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H ■ ABSTRACT 

In addition to optical photometry of unprecedented quality, the Sloan Digital Sky Survey (SDSS) 

is producing a massive spectroscopic database which already contains over 280,000 stellar spectra. 

.^ . Using effective temperature and metallicity derived from SDSS spectra for ~60,000 F and G type 

rsi ' main sequence stars (0.2 < g — r < 0.6), we develop polynomial models, reminiscent of traditional 

I methods based on the UBV photometry, for estimating these parameters from the SDSS u—g and g~r 

r~~i, colors. These estimators reproduce SDSS spectroscopic parameters with a root-mean-square scatter 

i-^ ' of 100 K for effective temperature, and 0.2 dex for metallicity (limited by photometric errors), which 

*Y^] are similar to random and systematic uncertainties in spectroscopic determinations. We apply this 

Q , method to a photometric catalog of coadded SDSS observations and study the photometric metallicity 

H ' distribution of ^200,000 F and G type stars observed in 300 deg^ of high Galactic latitude sky. 

c/3 I These deeper {g < 20.5) and photometrically precise ('--^O.Ol mag) coadded data enable an accurate 

C^ . measurement of the unbiased metallicity distribution for a complete volume-limited sample of stars 

at distances between 500 pc and 8 kpc. The metallicity distribution can be exquisitely modeled 

»vj ! using two components with a spatially varying number ratio, that correspond to disk and halo. The 

^ i best-fit number ratio of the two components is consistent with that implied by the decomposition of 

(^ ' stellar counts profiles into exponential disk and power-law halo components by Juric et al. (2008). 

l/^ I The two components also possess the kinematics expected for disk and halo stars. The metallicity 

00 ' of the halo component can be modeled as a spatially invariant Gaussian distribution with a mean 

C^ ' of [Fe/H] = —1.46 and a standard deviation of ^0.3 dex. The disk metallicity distribution is non- 

l' , Gaussian, with a remarkably small scatter (rms'^0.16 dex) and the median smoothly decreasing with 

^— ^ ■ distance from the plane from —0.6 at 500 pc to —0.8 beyond several kpc. Similarly, we find using 

QQ ' proper motion measurements that a non-Gaussian rotational velocity distribution of disk stars shifts by 

f~^ , ~50 km/s as the distance from the plane increases from 500 pc to several kpc. Despite this similarity, 

H^' ■ the metallicity and rotational velocity distributions of disk stars are not correlated (Kendall's r — 

0.017 ± 0.018). This absence of a correlation between metallicity and kinematics for disk stars is in a 

conflict with the traditional decomposition in terms of thin and thick disks, which predicts a strong 

correlation (r = —0.30 ±0.04) at ^1 kpc from the mid-plane. Instead, the variation of the metallicity 

C^ ' and rotational velocity distributions can be modeled using non-Gaussian functions that retain their 

shapes and only shift as the distance from the mid-plane increases. We also study the metallicity 

distribution using a shallower (g < 19.5) but much larger sample of close to three million stars in 

8500 sq. deg. of sky included in SDSS Data Release 6. The large sky coverage enables the detection 

of coherent substructures in the kincmatics-metallicity space, such as the Monoceros stream, which 

rotates faster than the LSR, and has a median metallicity of [Fe/H] = —0.95, with an rms scatter 

of only ~0.15 dex. We extrapolate our results to the performance expected from the Large Synoptic 

Survey Telescope (LSST) and estimate that the LSST will obtain metallicity measurements accurate 

to 0.2 dex or better, with proper motion measurements accurate to ~0.2-0.5 mas/yr, for about 200 

million F/G dwarf stars within a distance limit of ^100 kpc {g < 23.5). 

Subject headings: methods: data analysis — stars: statistics — Galaxy: halo, kinematics and dy- 
namics, stellar content, structure 
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1. INTRODUCTION 

A major objective of modern astrophysics is to under- 
stand when and how galaxies formed, and how they have 
evolved since then. Our own galaxy, the Milky Way, 
provides a unique opportunity to study a galaxy in great 
detail by measuring and analyzing the properties of a 
large number of individual stars. 

The formation of galaxies like the Milky Way was long 
thought to be a steady process leading to a smooth dis- 
tribution of stars, with this standard view exemplified by 
the Bahcall & Soneira (1980) and Gilmore, Wyse & Kui- 
jken (1989) models, and described in detail by e.g. Ma- 
jewski (1993). In these smooth models, the spatial distri- 
bution of stars in the Milky Way is usually^'* modeled by 
three discrete components described using relatively sim- 
ple analytic expressions: the thin disk, the thick disk, and 
the halo. However, recent discoveries of complex sub- 
structure in the distribution and kinematics of the Milky 
Way's stars (e.g. Ivezic et al. 2000; Yanny et al. 2000; 
Vivas et al. 2001; Newberg et al. 2002; Gilmore, Wyse & 
Norris 2002; Majewski et al. 2003; Duffau, Zinn & Vivas 
2006; Vivas & Zinn 2006; Grillmair 2006ab; Belokurov et 
al. 2006, 2007; BeU et al. 2007; Juric et al. 2008) have 
deeply shaken this standard view. Unlike those smooth 
models that involve simple components, the new data in- 
dicate many irregular structures, such as the Sgr dwarf 
tidal stream in the halo and the Monoceros stream closer 
to the Galactic plane. These recent developments, based 
on accurate large-area surveys, have made it abundantly 
clear that the Milky Way is a complex and dynamical 
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structure that is still being shaped by the infall (merg- 
ing) of neighboring smaller galaxies. 

Numerical simulations suggest that this merger process 
plays a crucial role in setting the structure and motions 
of stars within galaxies, and is a generic feature of current 
cosmological models (e.g., Helmi et al. 1999; Springel & 
Hernquist 2003; Bullock & Johnston 2005). Since the 
individual stars that make up the stellar populations in 
the Milky Way can be studied in great detail, their char- 
acterization will provide clues about the galaxy merging 
process that cannot be extracted from observations of 
distant galaxies (e.g., Abadi et al. 2003; Brook et al. 
2004; and references therein). 

The three presumably discrete Milky Way components 
differ not only in their spatial profiles, but also in the 
detailed distributions of their kinematics and metallic- 
ity (e.g., Majewski 1993; Ojha et al. 1996; Freeman & 
Bland-Hawthorn 2002; Robin et al. 2003; Wyse 2006; 
and references therein). The thin disk, with a scale 
height of ^300 pc, has a vertical velocity dispersion of 
CTz ^ 20 km s^^, while the thick disk, with a scale height 
of ^1000 pc, is somewhat warmer (cr^ ^ 40 km s^^) 
and older, has a lower average metallicity {[Z/Zq] ~ 
—0.7, e.g., Gilmore & Wyse 1985), and has enhanced 
Qf-element abundances (e.g., Fuhrmann 2004; Bensby et 
al. 2004; Feltzing 2006; Reddy et al. 2006; Ramirez 
et al. 2007). In contrast, the halo is composed mainly 
of low- metallicity stars {[Z/Zq] < —1.0, e.g., Ryan & 
Norris 1991), and has little or no net rotation. Hence, in 
addition to their spatial profiles, the main differences be- 
tween these components are in their rotational velocity 
distributions, velocity dispersions, and metallicity distri- 
butions. 

We note that a recent study by CaroUo et al. (2007), 
based on a sample of over 20,000 calibration stars with 
available spectra from SDSS Data Release 5, has demon- 
strated that "the halo" of the Galaxy is likely to comprise 
two distinct components. According to these authors, 
the inner-halo component dominates the population of 
halo stars found at distances up to 10-15 kpc from the 
Galactic center (including the Solar neighborhood), and 
an outer-halo component dominates in the regions be- 
yond 15-20 kpc. The inner halo stars are non-spherically 
distributed about the center of the Galaxy, with an in- 
ferred axes ratio of ~0.6, while the outer halo comprises 
stars that exhibit a much more spherical spatial distribu- 
tion. Our present study only reaches to 8 kpc from the 
Sun, and hence is likely to be dominated by inner-halo 
stars. Therefore, for the purpose of the present paper, 
we assume a single-component halo. 

Despite the significant progress that has been made 
over the years, we still cannot answer some simple ques- 
tions such as: Are the exponential profiles used to de- 
scribe the spatial profiles of thin and thick disks an over- 
simplification? Why do estimates for thick disk scale 
height differ by a factor of several between different stud- 
ies (for a discussion see Siegel et al. 2002 and Juric et 
al. 2008)? Is the transition between thin and thick disks 
in metallicity and kinematics abrupt or continuous? Is 
there a large-scale metallicity gradient in the thick disk 
and halo? Does the disk scale length depend on metal- 
licity? Can large spatial substructures be traced in kine- 
matic and metallicity spaces? 

To reliably answer these and similar questions, a data 



set needs to be both voluminous (to enable sufficient spa- 
tial, kinematic and metallicity resolution) , diverse (accu- 
rate distance and metallicity estimates, as well as radial 
velocity and proper motion measurements are required), 
and faint (to probe a significant fraction of the Galaxy). 
Modern sky surveys, such as the Sloan Digital Sky Sur- 
vey (hereafter SDSS, York et al. 2000), with its imaging 
and spectroscopic components, and the Two Micron All 
Sky Survey (Skrutskie et al. 2006) with its all-sky cov- 
erage, have recently provided such data sets. 

Most studies of the Milky Way structure can be de- 
scribed as investigations of the stellar distribution in 
the nine-dimensional space spanned by the three spa- 
tial coordinates, three velocity components, and three 
main stellar parameters (luminosity, effective tempera- 
ture, and metallicity). Depending on the quality, diver- 
sity and quantity of data, such studies typically concen- 
trate on only a hmited region of this space (e.g., the 
nearby solar neighborhood, pencil beam surveys, kine- 
matically biased surveys), or consider only marginal dis- 
tributions of selected quantities (e.g., number density of 
stars irrespective of their metallicity or kinematics, lu- 
minosity function determinations, proper motion surveys 
without metallicity or radial velocity information). We 
use the SDSS data to study in detail the stellar distri- 
bution in this multi-dimensional space. We focus on the 
metallicity distribution of disk and halo stars in this con- 
tribution. In companion papers we discuss the spatial 
distribution of stars (Juric et al. 2008, hereafter JOS) 
and their kinematics (Bond et al. 2008, in prep., here- 
after B08). 

In §2, we use the data for ^--^60,000 probable F and G 
type main sequence stars provided by the SDSS spec- 
troscopic survey to calibrate a method for estimating 
metallicity from the u — g and g — r colors measured 
by the SDSS photometric survey. Readers who are not 
interested in technical aspects of this method may want 
to skip directly to §3, where we apply this method to 
two photometric catalogs constructed using SDSS data. 
One catalog contains averaged repeated observations, 
and provides sufficiently improved photometric accuracy 
and depth to study the metallicity distribution all the 
way to the disk-halo interface at several kpc from the 
Galactic plane. The second catalog, based on all SDSS 
photometric observations to date, covers a wide area and 
probes a significant fraction of the Galaxy. We summa- 
rize and discuss our results in 54. 



2. DETERMINATION OF STELLAR METALLICITY FROM 
SDSS PHOTOMETRIC DATA 

The most accurate measurements of stellar metallic- 
ity are based on spectroscopic observations. Despite the 
recent progress in the availability of stellar spectra (e.g., 
SDSS has recently made publicly available^^ over 280,000 
stellar spectra as a part of its Data Release 6; the pro- 
posed extension of SDSS, known as SDSS-III, is capa- 
ble of providing another several hundred thousand stars 
with available spectra in the next few years; RAVE^^ 
may provide up to a million spectra, primarily thin- and 
thick-disk stars, over the next few years), the number of 
stars detected in imaging surveys is vastly larger. In ad- 

^^ See http: //www.sdss.org/dr6 

■^^ See|tLttp://www.rave-siirvey.aip.de/rave| 



dition to generally providing better sky and depth cover- 
age than spectroscopic surveys, imaging surveys obtain 
essentially complete flux-limited samples of stars. The 
simple selection criteria used for the photometric surveys 
are advantageous when studying Galactic structure, com- 
pared to the complex targeting criteria that are used for 
SDSS stellar spectra (see 5 l2.2l below1. Hence, we use the 
extant SDSS spectroscopic data to calibrate a method for 
estimating metallicity from the SDSS imaging data, and 
use this calibration to study the metallicity distribution 
of several million disk and halo stars of the Milky Way. 
Stellar metallicity has long been estimated using pho- 
tometric methods such as the traditional UV excess 
based 5{U — -B)o.6 method (Wallerstein 1962; Sandage 
1969). A blue main sequence (F and G type) star's 
metallicity is correlated with the difference between the 
star's U — B color and that which would be measured 
for a metal-rich star with the same B — V color. This 
correlation is seen in both data (e.g. Carney 1979 and 
references therein) and detailed stellar models (Kurucz 
1979). The Johnson UBV bands are similar to SDSS's 
ugr bands and thus it should be possible to derive an 
analogous method applicable to SDSS photometric sys- 
tem, as recently attempted by Karaali, Bilir & Tuncel 
(2005). However, as they pointed out, their study did not 
utilize SDSS data, but a somewhat different photometric 
system. Unfortunately, even small photometric offsets 
and color terms between different photometric systems 
may have significant systematic effects on derived metal- 
licities. For example, the SDSS u-band measurements 
are offset from the AB system by ^0.04 mag (Eisenstein 
et al. 2006; Holberg & Bergeron 2006), leading to a 
metallicity bias of up to 0.2 dex. Here we derive photo- 
metric metallicity estimators for the SDSS filter set using 
SDSS Data Release 6 data. This calibration relies on the 
large number of stars (^287,000) with a homogeneous 
set of stellar parameters (effective temperature, metallic- 
ity and gravity) derived from moderate-resolution SDSS 
spectra (Beers et al. 2006; AUende Prieto et al. 2006; 
Lee et al. 2007ab; Allende Prieto et al. 2007). 



2.1. An Overview of the Sloan Digital Sky Survey 

The SDSS is a digital photometric and spectroscopic 
survey which covers about one quarter of the Celestial 
Sphere in the North Galactic cap, as well as a smaller 
area (~300 deg^) but much deeper survey in the Southern 
Galactic hemisphere (Stoughton et al. 2002; Abazajian 
et al. 2003, 2004, 2005; Adclman-McCarthy et al. 2006). 
SDSS is using a dedicated 2.5m telescope (Gunn et al. 
2006) to provide homogeneous and deep (r < 22.5) pho- 
tometry in five bandpasses (Fukugita et al. 1996; Gunn 
et al. 1998; Smith et al. 2002; Hogg et al. 2002; Tucker 
et al. 2006) repeatable to 0.02 mag (root-mean-square 
scatter, hereafter rms, for sources not limited by photon 
statistics, Ivezic et al. 2003) and with a zeropoint uncer- 
tainty of -0.02-0.03 mag (Ivezic et al. 2004). The flux 
densities of detected objects are measured almost simul- 
taneously in five bands (m, g, r, «, and z) with effective 
wavelengths of 3540 A, 4760 A, 6280 A, 7690 A, and 9250 
A. The large survey sky coverage will result in photomet- 
ric measurements for well over 100 million stars and a 



similar number of galaxies^^. The completeness of SDSS 
catalogs for point sources is ^99.3% at the bright end 
and drops to 95% at magnitudes of 22.1, 22.4, 22.1, 21.2, 
and 20.3 in u, g, r, i and z, respectively. Astrometric 
positions are accurate to better than 0.1 arcsec per coor- 
dinate (rms) for sources with r < 20.5 (Pier et al. 2003), 
and the morphological information from the images al- 
lows reliable star-galaxy separation to r ~ 21.5 (Lupton 
et al. 2002; Scranton et al. 2002). A compendium of 
other technical details about SDSS can be found on the 
SDSS web site (http://www.sdss.orgj , which also pro- 
vides interface for the puETic data access. 

2.2. SDSS spectroscopic survey of stars 

SDSS spectra are obtained with a pair of dual multi- 
object fiber-fed spectrographs on the same telescope used 
for the imaging survey (Uomoto et al., in prep). Spec- 
troscopic plates have a radius of 1.49 degrees and take 
640 simultaneous spectra, each with wavelength coverage 
3800-9200 A and spectral resolution of i? - 2000. The 
signal-to-noise ratio is typically >4 per pixel at 5=20, but 
is substantially higher for brighter point sources, such as 
considered herein. 

Targets for the spectroscopic survey are chosen from 
the SDSS imaging data, described above, based on their 
colors and morphological properties^*. The targets in- 
clude 

• Galaxies: a simple flux limit for "main" galaxies, 
flux-color cut for luminous red galaxies (Strauss et 
al. 2002; Eisenstein et al. 2001) 

• Quasars: flux-color cut, matches to FIRST survey 
(Richards et al. 2002) 

• Non-tiled objects (color-selected): calibration 
stars (16 per plate), "interesting" stars (hot white 
dwarfs, brown dwarfs, red dwarfs, red giants, blue 
horizontal branch stars, carbon stars, cataclysmic 
variables, central stars of planetary nebulae), sky 

Here, (non)-tiled objects refers to objects that are not 
guaranteed a fiber assignment. As an illustration of the 
fiber assignments, SDSS Data Release 6 contains spectra 
of 791,000 galaxies, 104,000 quasars, and 287,000 stars. 
The spectra are targeted and automatically processed 
by three pipelines: 

• target: Target selection and tiling 

• spectro2d: Extraction of spectra, sky subtrac- 
tion, wavelength and flux calibration, combination 
of multiple exposures 

• spectrold: Object classification, redshifts deter- 
mination, measurement of line strengths and line 
indices 



For each object in the spectroscopic survey, a spec- 
tral type, redshift (or radial velocity), and redshift error 
is determined by matching the measured spectrum to a 
set of templates. The stellar templates are calibrated 
using the ELODIE stellar library. Random errors for 
the radial velocity measurements are a strong function 
of spectral type and signal-to-noise ratio, but are usu- 
ally < 5 km s""'^ for stars brighter than g ~ 18, rising 
sharply to ~25 km s^^ for stars with g — 20. Using a 
sample of multiply-observed stars, Pourbaix et al. (2005) 
estimate that these errors may be underestimated by a 
factor of ~1.5. Further technical details about SDSS 
spectroscopic survey are available from www.sdss.org. 



2.3. Stellar Atmospheric Parameter Estimation 

SDSS stellar spectra are of sufficient quality to provide 
robust and accurate stellar parameters, such as effective 
temperature, surface gravity, and metallicity (parame- 
terized as [Fe/H]). These parameters are estimated us- 
ing a variety of methods implemented in an automated 
pipehne (the SEGUE Stellar Parameters Pipeline, SSPP; 
Beers et al. 2006). A detailed discussion of these meth- 
ods and their performance can be found in AUende Prieto 
et al. (2006, 2007) and Lee et al. (2007a,b). Based on 
a comparison with high-resolution abundance determi- 
nations, they demonstrate that the combination of spec- 
troscopy and photometry from SDSS is capable of de- 
livering estimates of T^s, log{g), and [Fe/H] with ex- 
ternal accuracies of 190 K (3.2 %), 0.28 dex, and 0.17 
dex, respectively. These tests indicate that mean sys- 
tematic errors for [Fe/H] and Toff should not be larger 
than about 0.2 dex and 100 K, and may be below 0.1 dex 
and 75 K (Lee et al. 2007b). Note that these estimates 
apply to stars with a wider range of temperatures than 
we consider in this study. 

We use the final adopted values, called teffa and feha 
in the SDSS sppParams table, which are based on averag- 
ing several different methods. A detailed analysis by Lee 
et al. (2007a, b) demonstrates that systematic metallicity 
differences between the methods used in averaging do not 
exceed ~0.1 dex. A comparison with Galactic open and 
globular clusters indicates that the adopted metallicity 
scale systematically overestimates metallicity by ~0.15 
dex for [Fe/H] < — 2 and underestimates metallicity by 
up to ^0.3 dex for stars near Solar metallicity (the metal- 
licity offsets have been improved recently, and are now 
essentially nil, but for the purpose of this paper, we have 
made use of a previous version of the SSPP, hence the 
sytematics remain present). 

Only a few percent of stars in SDSS spectroscopic sam- 
ple are giants. For this reason, we consider only the main 
sequence stars, using the selection criteria described be- 
low. Although we address photometric estimates of ef- 
fective temperature, the main goal of this section is to 
derive a robust and accurate photometric metallicity es- 
timator. 



^'^ The recent Data Release 6 lists photometric data for 287 
million unique objects observed in 9583 deg^ of sky; Adclman- 
McCarthy et al. 2008; see http://www.sdss.org/dr6/ 

^* The recent extension of SDSS survey, known as SDSS-II, has 
different targeting priorities. In particular, the sub-survey known 
as SEGUE (Sloan Extension for Galactic Understanding and Ex- 
ploration), is optimized for Galactic structure studies. 



2.3.1. Sample Selection 

We begin by selecting bright stars from the main stel- 
lar locus (Lcnz et al. 1998; Fan 1999;, Finlator et al. 
2000; Smolcic et al. 2004), with colors located in the 
proper range for the application of photometric metallic- 



median metolliclty 




-0.1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 

g r 

Fig. 1. — The linearly-spaced contours show the distribution 
of ~110,000 stars with g < 19.5 and -0.1 < g - r < 0.9 (corre- 
sponding to effective temperatures in the range 4500 K to 8200 K) 
from the SDSS DR6 spectroscopic sample in the log(g) vs. g — r 
plane. The multi-modal distribution is a result of SDSS target se- 
lection algorithm. The color scheme shows the median metallicity 
in all 0.02 mag by 0.06 dex large pixels that contain at least 10 
stars. The fraction of stars with log(g)<3 (giants) is 4%, and they 
are mostly found in two color regions: —0.1 < g — r < 0.2 (BHB 
stars) and 0.4 < g — r < 0.65 (red giants). They are dominated 
by low-metallicity stars {[Fe/H] < —1). The dashed lines outline 
the main-sequence (MS) region selected for deriving photometric 
estimates for effective temperature and metallicity. 



ity method (roughly^^ 0A< B-V < 0.8, Carney 1979), 
and from sky regions with modest interstellar dust ex- 
tinction (SDSS utilizes the Schlegel, Finkbeiner & Davis 
1998 maps). The specific criteria applied to 130,620 
entries from the so-called sppParams table"^*^ that have 
log{g) > are 

1. The interstellar extinction in the r band below 0.3: 
[106,816] 



2. 14 < g < 19.5: [104,844] 

3. 0.2 <{g-r) < 0.6: [75,928] 

4. 0.7 <{u- g) < 2.0 and -0.25 < {g 
g) < 0.05: [68,306] 



r) -0.5(u 



5. -0.2 < 0.35(5 -r)-{r-i) < 0.10: [66,496] 

The numbers in brackets indicates the number of stars 
left after each selection step. 

Using a photometric parallax relation based on obser- 
vations of globular clusters (see Appendix A for a de- 
tailed discussion), 

Mr{g - z, [Fe/H]) = M°{g - z) + AMr{[Fe/H]), (1) 

where AMr{[Fe/H]) and M^{g~i) are given by eqs.[A2] 
and IA71 respectively, we further limit the sample to 
61,861 stars in the 1-10 kpc distance range. Due to the 



small r — i color range spanned by F/G stars, when com- 
paring results to J08 it is better to estimate the r — i 
color from the better measured <? — i color using a stellar 
locus relation'^ ^ 



g-r = 1.39(1 
-2Ab{r - if - l.i 



exp[-4.9(r- if 
Kr-i)- 0.050]). 



(2) 



The selected stars span the 5000-7000 K temperature 
range (with a median of 5,900 K) and 99.4% have metal- 
licity in the range —3 to (with a median of —1.0). While 
the sample is dominated by main sequence stars (the 
median log(g) is 4.1, with an rms scatter of 0.44 dex), 
a small fraction of stars have gravity estimates consis- 
tent with giants (see Figure [T|). We exclude ~3% of 
stars with log(g)<3 (which typically have lower metal- 
licity than dwarfs, with a median [Fe/H] = —1.5; see 
Figure[T]), resulting in a final calibration sample of 59,789 
stars. This fraction of giants is relatively high because 
the g — r r^ 0.5 color range, where the fraction of gi- 
ants is the highest, was deliberately targeted for SDSS 
spectroscopy; about 7% of stars in the subsample with 
0.4 < 5 - r < 0.6 have log(g)<3. 

2.3.2. Effective Temperature 

The dependence of the median effective temperature 
and metallicity on the position in the g — r vs. u — g 
color-color diagram for the final sample of 59,789 stars is 
shown in Figure [D The top left panel demonstrates that 
the effective temperature, Tes, can be determined from 
the g — r color alone, with a negligible dependence on the 
u — g color (the gradient of log(reff) with respect to the 
g — r color is at least ^ 60 times as large as the gradient 
with respect to the u — g color). This difference in gra- 
dients is due to a general insensitivity to metallicity of 
relationships between effective temperature and colors at 
wavelengths longer than 0.4 ^m (Sandage & Smith 1963; 
Mannery & Wallerstein 1971). The best-fit expression. 



log(roff/K) = 3.872 - 0.264 {g - r) 



(3) 



reproduces SDSS spectroscopic temperature for 59,789 
main sequence stars selected from the 0.2 < g — r < 0.6 
color range with an rms scatter of 0.007 dex (correspond- 
ing to '^lOO K). When residuals are binned in 0.01 mag 
wide g — r bins, the largest median residual is 0.003 dex 
(~40 K at the median temperature of 5900 K), demon- 
strating that a linear fit is sufhcient. When residuals are 
binned in 0.1 dex wide metallicity bins, the largest me- 
dian residual is also 0.003 dex. There is no discernible 
dependence of residuals on metallicity for stars with 
[Fe/H] < -1, while for stars with -1 < [Fe/H] < -0.5 
a gradient of logiT) is 0.008 per decade of metallicity 
(dex per dex) is present. 

This behavior is consistent with results based on 
temperatures derived with the infrared flux method 
(Ramirez & Melendez 2005; Casagrande, Portinari & 
Flynn 2006, hereafter CPF). For example, the expression 
for effective temperature as a function oi B — V color and 
metallicity from CPF predicts an effective temperature 



^^ At the ~0.05 mag accuracy level, B — y=0.949(g — r)-|-0.20; ^^ JOS uses a maximum hkehhood projection on the mean stellar 

for more accurate (<0.01 mag) transformations see Ivezic et al. locus, which avoids this problem. At the bright end that is relevant 

(2006a). h ere, the two methods produce essentially the same results; we 

'^'^ Available from|http://www.sdss.org/dr6/products/spectra/spectrop^tedefaa'sthit)mimpler one. 
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Fig. 2. — The correlation of spectroscopic effective temperature (top left) and metallicity (top right) with the position of a star in the 
g — r vs. u — g color-color diagram. The color scheme shows the median values in all 0.02 mag by 0.02 mag large pixels that contain at 
least 10 stars. The distribution of stars in an imaging sample with g < 19.5 is shown by linearly-spaced contours. The dots show the 
synthetic colors for the Pickles (1998) solar metallicity standards (FO, F2, F5, F6, F8, and GO, from bottom to top), taken from Covey et 
al. (2007). The triangular region marked "TOO FAINT" in the top right panel contains no stars, due to the g < 19.5 flux limit and the 
fact that low-metallicity stars are generally more distant and fainter than high-metallicity stars. The bottom left panel shows the median 
residuals between the spectroscopic metallicity and photometric estimates based on eq. |4] Their root-mean-square scatter (over all pixels) 
is 0.06 dex. The bottom right panel shows a map of the root-mean scatter of metallicity for individual stars in each pixel. Its median value 
is 0.21 dex. The scatter is larger for weak-lined low-metallicity stars (~0.3 dex) than for high-metallicity stars (~0.15 dex), as expected. 



of 5700 Kioi- B-V = 0.6 {g - r=0.425) and metallic- 
ity of —1.0, with the latter corresponding to the median 
metallicity of stars in the SDSS spectroscopic sample. 
The effective temperature predicted by eq. [3] is 5750 K 
(a discrepancy of 0.004 dex), and the median spectro- 
scopic temperature for stars with 0.42 < g — r < 0.43 
is 5730 K. We note that both the CPF relation and 
Ramirez & Melendez (see Figs. 1 and 10) predict a 
steeper dependence of effective temperature on metallic- 
ity: a,t B — V = 0.6 the predicted effective temperature 
increases by 180 K as metallicity increases from —1.5 to 
—0.5, while in the SDSS spectroscopic sample the cor- 
responding temperature increase is 50 K. Discrepancies 
with the expression proposed by Sekiguchi & Fukugita 
(2000) are somewhat larger. Their effective tempera- 
ture scale is cooler by ~130 K than the SDSS scale, 
and log(Tcff) residuals are correlated with metallicity and 
log(g) with gradients of about 0.01 dex per dex. Further 
details about the behavior of photometric temperature 
estimator are discussed in Appendix B. 



2.3.3. Metallicity 

As first suggested by Schwarzschild, Searle, & Howard 
(1955), the depletion of metals in a stellar atmosphere 
has a detectable effect on the emergent flux, in partic- 
ular in the blue region where the density of metallicity 
absorption lines is highest (Beers & Christlieb 2005, and 
references therein). The median metallicity of stars se- 
lected from the SDSS spectroscopic sample as a function 
of the u — g and g — r colors shows a complex behavior 
that is consistent with expectations: the detailed depen- 
dence of the UV excess (i.e., the u—g color) on metallicity 
varies with effective temperature (i.e., the g — r color). 
Even when the g — r vs. u — g plane is separated by 
g — r — 0.4 into two regions suggested by the metallicity 
map, at least second-order polynomials, or several piece- 
wise linear fits, are required to avoid systematic errors 
larger than 0.1 dex. In order to do so for the entire map 
with a single function, we find that third-order terms are 
required, and model the map as: 

[Fe/H]ph ^A + Bx + Cy + Dxy + Ex^ + Fy^ (4) 



+Gx^y + Hxy^ + Ix'^ + Jy^ 



where 



TABLE 1 
Sample Distance Limits 



X = (u — g) for {g — r) < 0.4, and x = {u — g) ~ 
2(g -r) + 0.8 for {g-r)> 0.4 (this dual definition 
is required to describe the map with a single set of 
coefficients, A~J) 



V = ig-r), and 

(A-J) = (-4.37, -8.56, f5.5, -39.0, 
f2.1, 7.33, -fO.l, -21.4). 



23.5, 20.5, 



The above expression describes the median metallicity 
map shown in the top right panel in Figure [2] with a 
root-mean-square (rms) scatter of 0.09 dex. This level 
of systematic calibration errors is negligible compared to 
random errors per star ('^ 0.2 dex, due to photometric 
errors), discussed below, and is comparable to systematic 
errors in the SDSS spectroscopic metallicity estimates'^^ . 
A map of the median residuals, when fitting the median 
metallicity map using eq. IH in the g — r vs. u — g plane is 
shown in the bottom left panel in Figured It illustrates 
that there is no strong correlation between systematic 
errors in photometric metallicity computed with eq. |4] 
(<0.1 dex) and the u — g and g — r colors. 

We compute photometric metallicity estimates for all 
59,789 stars in the sample using eq.[4l and compare these 
to the spectroscopic metallicity determinations. The rms 
scatter of metallicity residuals is 0.24 dex (determined 
from the interquartile range), and the distribution of 
residuals is only slightly non-Gaussian (97% of the sam- 
ple is contained within a ±3(t range). 

The rms scatter of the metallicity residuals depends 
on both the apparent magnitude and color of the star 
under consideration. The bottom right panel in Figure [2] 
illustrates the color dependence: for low-metallicity stars 
the rms increases to <0.3 dex, while it is about 0.15 dex 
or less for high-metallicity stars. This is expected, due 
to the weaker spectral lines in low-metallicity stars (e.g., 
Du et al. 2004; Keller et al. 2007). 

The rms scatter of metallicity residuals increases with 
the g-band magnitude from 0.18 dex for g <17, to 0.25 
dex at g = 18, and 0.45 dex at g = 19.5. The ran- 
dom metallicity errors are dominated by the errors in 
the M-band magnitudes. The ratio of this scatter to 
the scatter expected due to photometric errors (which 
is readily computed from eq. |4]) is 1.7, and is nearly in- 
dependent of magnitude. The implied random errors in 
spectroscopic and photometric metallicity estimates are 
thus comparable, and have similar signal-to-noise prop- 
erties. In particular, we estimate that random errors 
in spectroscopic metallicity estimates increase from 0.15 
dex for g < 17 to 0.36 dex at g = 19.5 (for comparison, 
the corresponding values for photometric metallicity es- 
timates are 0.10 dex and 0.30 dex, respecitvely). This 
seemingly surprising result, that the estimated errors for 
photometric metallicity are smaller than those obtained 

^^ The systematic errors are much larger for stars with log(g)<3: 
for example, for stars with 0.4 < g — r < 0.6 and log(g)=2.5, the 
photometric metallicity estimate is 0.5 dex larger than the spec- 
troscopic metallicity (when the systematic error vs. log(g) trend 
of about 0.35 dex per dex is corrected for, the rms scatter of the 
metallicity residuals for log(g)<3.5 is ~ 0.3 dex). 
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'^ The mean r — i color on the main stellar locus for 
the g — r color listed in first column, evaluated using 
eq.[2]'' The absolute magnitude in the g band, evaluated 
for a fiducial [Fe/H] = —1.0 using eq. [T]'^ The distance 
for a star with g = 19.5 (kpc).'' The Johnson B — V 
color, computed for convenience from SDSS photometry 
using transformations from Ivezic et al. (2006a). ° The 
absolute magnitude in the Johnson V band, computed 
from Ala- 



for spectroscopic metallicity estimates, despite the for- 
mer being calibrated off the latter, is due to the aver- 
aging of many spectroscopic estimates in a given small 
color-color bin when calibrating photometric metallicity, 
and the fact that the signal for photometric metallicity 
estimates predominantly comes from wavelengths shorter 
than 0.4 /zm, while for spectroscopic metallicity estimates 
are obtained from longer wavelengths. 

This error behavior limits the application of photomet- 
ric metallicity estimates, based on SDSS data, to about 
g < 19.5. This limit is essentially set by the precision of 
the u-band photometry (u < 20.5). Somewhat coinciden- 
tally^'^, this is about the same limiting depth as for the 
SDSS spectroscopic sample (the spectroscopic targeting 
limit for the SEGUE survey is g < 20) . Despite this lim- 
itation, the photometric metallicity estimator given by 
eq. m is a valuable tool, because it allows metallicity to 
be determined for all main sequence SDSS stars in the 
0.2 < g — r < 0.6 color range. For example, in SDSS 
DR6, out of ^--^5.7 million point sources from this color 
range that are brighter than g = 19.5, SDSS spectra 
classified as stars are available only for ^94,000 objects. 
This implies a sample size increase by about factor of 60 
when using photometric metallicity estimates. Further- 
more, when deeper data are available, the photometric 
metallicity estimator can be used to study the metallic- 
ity distribution in the Galaxy to distances beyond the 
reach of main sequence stars in the spectroscopic sample 
(a small number of giants in the spectroscopic sample, 
which reach to distances ~100 kpc, cannot be easily rec- 
ognized using photometry alone; however, see Helmi et 
al. 2003). Further details about the behavior of the pho- 
tometric metallicity estimator are discussed in Appendix 
C. 

3. ANALYSIS OF THE STELLAR PHOTOMETRIC 
METALLICITY ESTIMATES 



^•^ The similar depths are not entirely independent, as they both 
reflect the atmospheric and sky properties, and various scientific 
tradeoffs, but this discussion is beyond the scope of this paper. 



We now use the photometric metanicity estimator de- 
veloped above to study the steUar metaUicity distribu- 
tion as a function of position in the Galaxy and stel- 
lar kinematics. We consider stars in a restricted color 
range, 0.2 < g ~ r < 0.4, because the redder stars 
(0.4 < g — r < 0.6) do not extend as far into the halo 
(due to their smaller luminosities; Table 1). The small 
color range also minimizes various selection effects that 
could be larger for a wider color/luminosity range (such 
as uncertainties in the photometric parallax relation and 
contamination by giants). As an additional motivation, 
in this color range metaUicity is nearly a function of the 
u — g color alone (eq. IC2I in Appendix C), which al- 
lows a simple assessment of the impact of photometric 
errors in the u band on derived metaUicity. The adopted 
0.2 < g — r < 0.4 color range spans about 10 MK spectral 
subtypes (from ~F5 to ~G5; Bailer- Jones et al. 1997, 
1998). The median absolute magnitude in this color 
range is Mg ~ 4.6, with an rms scatter of 0.3 mag and a 
difference of ~2.2 mag in Mg between the blue and red 
ends (for a fiducial [Fe/H] = — 1, see Table 1). 

We consider two photometric catalogs constructed us- 
ing SDSS data. A catalog of coadded repeated observa- 
tions (10 on average; Ivezic et al. 2007), known as the 
SDSS Stripe 82 catalog, provides improved photometric 
accuracy to a fainter flux limit in ^300 deg^ of sky. For 
example, while single-epoch SDSS data deliver a median 
u — g error of 0.06 mag at g = 19.5 (for point sources 
with 0.2 < g ~ r < 0.4), the same level of accuracy is 
extended to beyond g = 20.5 in the coadded catalog. 
This allows us to study the metaUicity distribution all 
the way to the disk-halo interface, at several kpc from 
the Galactic plane, with small metaUicity errors. At the 
bright end, the random errors in the u — g error are 0.01 
mag in the coadded catalog, and 0.025 mag in single- 
epoch data (an error in the u — g color of 0.02 mag in- 
duces a metaUicity error in [Fe/H] that varies from 0.02 
dex at [Fe/H] = -0.5 to 0.11 dex at [Fe/H] = -1.5). 
This improvement in photometric metaUicity accuracy 
by more than a factor of two enables robust estimates of 
the metaUicity distribution width for disk stars. How- 
ever, an important disadvantage of using the coadded 
catalog is its very small sky coverage. Hence, we extend 
our study to a significant fraction of the Galaxy by us- 
ing a wide-area catalog based on SDSS Data Release 6 
(DR6 catalog, hereafter). This catalog covers an area 
~30 times larger than the deep coadded catalog, at the 
expense of a '--^l mag shallower sample. 

We begin our analysis with a discussion of the stellar 
distribution in the g vs. u — g color-magnitude diagram, 
which reveals several features that are central to the con- 
clusions of this paper. While this diagram maps well to 
a distance vs. metaUicity plane, as discussed in § 13.21 
further below, we choose to first describe these features 
using directly observed quantities. When discussing posi- 
tions of stars in the Milky Way, we use the usual cylindri- 
cal coordinate system {R, (p, Z) aligned with the Galactic 
center (assumed to be at a distance of 8.0 kpc) and with 
the Z axis towards the north Galactic pole. For projec- 
tions parallel to the Galactic plane, we follow J08 and 
use right-handed X and Y coordinates, with the Sun at 
X = 8.0 kpc and the positive Y axis pointing towards 
/ = 270°. 



TABLE 2 

Best-fit Parameters for the Gaussians 
Shown in Fig. |4] 



g range" 


N^ 


/.<= 


a'i 


/i error'' 


15.0-15.5 


1,087 


1.095 


0.078 


2.4 


15.5-16.0 


1,605 


1.085 


0.076 


1.9 


16.0-16.5 


1,911 


1.082 


0.070 


1.6 


16.5-17.0 


2,328 


1.078 


0.070 


1.5 


17.0-17.5 


2,590 


1.075 


0.064 


1.3 


18.0-18.5 


3,348 


0.899 


0.050 


0.9 


18.5-19.0 


3,745 


0.901 


0.055 


0.9 


19.0-19.5 


4,504 


0.895 


0.055 


0.8 


19.5-20.0 


5,893 


0.891 


0.060 


0.8 


20.0-20.5 


8,712 


0.886 


0.075 


0.8 



'^ The g magnitude range. The number of stars 
in the bin.'' The best-fit mean u — g color (only 
data with u — g > 1.0 are fit in the five bright- 
est bins, and data with u — g < 0.95 in the five 
faintest bins (see Figure |4]l.'^ The best-fit distri- 
bution width.'' The statistical error in the mean 
(millimag) . 

3.1. The Bimodal u — g Distribution of F/G Stars 

We selected 110,363 sources from ~1.01 million entries 
in the Stripe 82 coadded catalog'^''' by requiring at least 
4 detections in the u band, 0.20 < g ~ r < 0.40 and 
g < 20.5. These sources have |<5j2ooo| < 1-266° and right 
ascension in the range 20h 34m to 4h 00m. For reference. 
Galactic coordinates, {l,b), are (46,-24), (96,-60) and 
(190,-37) for aj20oo=-50°, 0° and 60° (at Sj2ooo=0°)- 

The distribution of these stars in the g vs. u — g color- 
magnitude diagram is shown in the top left panel of Fig- 
ure [3l Bright red {g < 18, u — g ~ 1.1) and faint blue 
{g > 18, u — g ~ 0.9) features are clearly discernible, 
and are roughly separated by the u — g = 1 line (corre- 
sponding to [Fe/H] ~ —1.0). The marginal u — g dis- 
tributions for three g slices are shown in the bottom left 
panel. They can be approximately described by a sum of 
two ~0.07 mag wide Gaussians centered on u — g=0.90 
and 1.07, with the number ratio of the blue to red com- 
ponent increasing with magnitude from 1:7 in the blue 
bin to 20:1 in the red bin. The blue and red compo- 
nents correspond to distant metal-poor halo stars, and 
more metal-rich and closer disk stars, respectively, as 
discussed further below. The width of 0.07 mag is suf- 
ficiently larger than the median error in the u — g color 
(0.05 mag at 5=20.5, 0.04 at g=20, 0.02 at 5=19 and 0.01 
at 5=17.5) to provide a robust measure of the intrinsic 
width of the u — g distribution. 

In addition to an overall blueing of the median u — g 
color towards the faint end induced by the varying num- 
ber ratio of the two components, the median u — g color 
for each component also becomes bluer, as illustrated 
in Figure [4] and summarized in Table 2. We measure 
gradients of —0.012 mag/mag for the disk component^^ 
(14.5 < 5 < 17) and —0.006 mag/mag for the halo com- 
ponent (18 < 5 < 20.5), with statistical errors of ^0.002 
mag/mag. Using an approximate mapping from mag- 
nitude to distance, these color differences could be pro- 



http: 



•^^ This catalog is publicly available from 
//www. sdss.org/dr6/products/value_added/index. html. 

•^^ This gradient was accounted for in the definition of the s color 
by Ivezic et al. (2004), but its meaning was not understood at that 
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Fig. 3. — The top left panel shows the distribution of stars (logarithm of counts in each bin; low to high is blue to green to red) from the 
SDSS Stripe 82 catalo g wi th 0.2 < g — r < 0.4 in the g vs. u — g color-magnitude diagram. In this g — r range, the u — g color is a proxy 
for metallicity (see ea. lC2l in Appendix C). The two concentrations of stars correspond to disk {u — g ^ 1.1) and halo {u — g ^ 0.9) stars, 
with the dashed lines indicating the change of the median u — g color with magnitude for each concentration. The u — g color distributions 
in three magnitude slices, marked by vertical bars in the top left panel, are shown in the bottom left panel. All three histograms can be 
approximately described by a sum of two ~0.07 mag wide Gaussians centered on it — g=0.90 and 1.08, with the number ratio of blue to 
red component increasing with magnitude from 1:7 to 20:1. For detailed fits to the u — g color distribution as a function of magnitude, see 
Figure |4] The top right panel is analogous to the top left panel, except that a complete imaging sample of stars is replaced by stars from 
the SDSS spectroscopic survey. The spectroscopic sample is highly incomplete, as evident from the patchy distribution. The bottom right 
panel shows the same sample of stars from the spectroscopic survey, with the u — g color replaced by spectroscopic metallicity. 

duced by a gradient of roughly 0.02 mag/kpc between \Z\ 
— 1 kpc and 2.5 kpc for disk stars, and 0.003 mag/kpc 
between \Z\ — 4 kpc and 10 kpc for halo stars. Hence, 
the color gradient per kpc is about 7 times larger for disk 
stars. 

The detected color gradient cannot be caused by po- 
tential errors in the applied corrections for interstellar 
extinction (Schlegel, Finkbeiner & Davis, 1998). The 
median value of the u — g reddening correction is only 
0.05 mag, and even the closest stars (at 500 pc) are well 
beyond the ~100 pc thick dust layer (JOS). Such a gra- 



dient ('--'0.05 mag between u = 14 and u = 19) could be 
caused by a non-linearity in the u-band measurements 
(based on a comparison with independently measured 
Stetson standard stars, this effect is ruled out for the 
g-band measurements; Ivezic et al. 2007). However, a it- 
band non-linearity at the 0.05 mag level is excluded by in 
situ measurements of the hardware response curve, and 
a comparison of reductions of SDSS data using several 
different pipelines (SExtractor, DAOPhot, and DoPhot; 
Becker et al. 2007) excludes such a large software error 
in the SDSS photometric pipeline. We proceed with an 
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Fig. 4. — The symbols with error bars show the u — g color 
distributions in 0.5 mag wide magnitude slices (see the top left 
panel in Figure[3ll, in the range g = 18 — 20.5 (left) and g = 14.5 — 
17.0 (right). The bottom histograms correspond to the brightest 
bin, and each histogram is successively offset by 2 for clarity. The 
solid lines show the best-fit Gaussians, fit to data with u — g< 0.95 
(left) and n — g > 1.0 (right), to minimize contamination by disk 
and halo stars, respectively. The best-fit parameters are listed in 
Table 2. The dashed black lines are the same as the solid black 
lines, and are added to illustrate the shift of histograms towards 
bluer colors for the faint bins. The gradients of the u — g color with 
respect to the g magnitude are —0.006 ±0.002 mag/mag (left) and 
-0.012 ± 0.002 mag/mag (right). 

assumption that this gradient is not a problem in the 
SDSS data. 

For selected stars with 0.2 < g — r < 0.4, the u — g color 
measures metallicity (see Appendix C), and the observed 
color scatter and color gradients correspond to the metal- 
licity distribution width and metallicity gradients. Be- 
cause the imaging sample is defined by a simple flux limit, 
these measurements are relatively easy to interpret. On 
the contrary, the SDSS spectroscopic sample has an ex- 
tremely biased distribution (by design) in the g vs. u — g 
color-magnitude diagram, as illustrated in the two right 
panels of Figure[3l it would not be easy to derive a robust 
selection function. Using eqs.[l]and|4l we find that the 
color gradients measured for the imaging sample roughly 
correspond to a ^--^0.06 dex/kpc metallicity gradient for 
disk stars at \Z\ ~1.5 kpc, and a ~0.01 dex/kpc for halo 
stars in the — Z — =4-10 kpc range (given the distance 
and sky coordinates, the three-dimensional position in 
the Galaxy can be trivially computed). In the remainder 
of this section we remap the g vs. u — g color-magnitude 
diagram to a distance-metallicity diagram, discuss the 
metallicity distribution as a function of the position in 
the Galaxy, develop a model that captures the data be- 
havior, and correlate the metallicity with the observed 
stelar kinematics. 

3.2. The Bimodal Metallicity Distribution of Thick 
Disk and Halo Stars 

Despite its small area, the Stripe 82 catalog covers a 
substantial range of R and \Z\, as shown in the top left 
panel in Figure [5] As evident from the dependence of 
the median metallicity on R and \Z\, the \Z\ gradient is 
much larger than the radial gradient (by about a factor of 
10). Given this large difference in metallicity gradients, 
we proceed by making the assumption that the metal- 
licity distribution is a function of the Z coordinate only 
(in S 13.51 below, we critically examine and justify this as- 
sumption using the DR6 catalog). To further minimize 
the effect of any radial gradient, we constrain the sample 
to -34,000 stars with 7 < i?/kpc < 9. 

The top right panel of Figure[5]shows the resulting con- 
ditional metallicity probability distribution for a given Z , 



p{[Fe/H]\Z). This distribution is computed as metallic- 
ity histograms in narrow Z slices, and normalized by the 
total number of stars in a given slice. Apart from renor- 
malization and the applied 7 < i?/kpc < 9 selection, 
this is essentially an upside-down warped version of the 
g vs. u — g color-magnitude diagram shown in the top 
left panel of Figure [31 The bright red and faint blue com- 
ponents from Figure [3] are now readily identifiable as the 
relatively close metal-rich disk component, and the more 
distant metal-poor halo component, respectively. 

3.3. A Simple Model for the Conditional Metallicity 
Probability Distribution 

As is evident from the p{[Fe/H]\Z) map shown in the 
top right panel of Figure [5l the Z gradient of the median 
metallicity map shown in the top left panel of Figure [5] 
is due to the varying contributions of a metal-rich disk 
component and a metal-poor halo component. We first 
attempt to model the p{[Fe/H]\Z) map using two Gaus- 
sian components with a Z-dependent ratio of their area 
(components' number ratio) 

pix = [Fe/H]\Z) = (1 - fn) G{x\fiD,^D) (5) 

+ fH G{x\hh,'7h), 



where 



G{x\^l,a) 



2-Ka 



(6) 



The distribution width for both components can be mod- 
eled as spatially invariant, ajj = 0.16 dex, and uh — 0.30 
dex, as is the case for the median halo metallicity, 
Ph = —1.46. The median and dispersion for metallic- 
ity distribution of halo stars is in good agreement with 
previous work (e.g., Ryan & Norris 1991). Due to the 
small errors in the u — g color for the coadded data, the 
contribution of measurement errors to gu and an is very 
small: the implied intrinsic widths are 0.16 dex and 0.29 
dex, respectively. 

Inspection of the p{\FelFl\\Z) map suggests that the 
variation of the median metallicity for the disk compo- 
nent'^^ in the 0.5 < jZj/kpc < 5 range can be described 
as 

P^d{Z) = /loo + Am exp(-|Z|/H^) dex, (7) 



with the best-fit values iJ^ = 1.0 kpc, [i^ 



-0.78 and 



Ap = 0.35. The best fit is shown by the curved dashed 
line in the top right panel in Figure [5l The exponential 
"height", iJ^, is constrained to only about 50% due to 
covariances with /Zoo and A^ (which are constrained to 
about 0.05 dex). The implied median metallicity values 
agree well with a value of —0.7 obtained by Gilmore & 
Wyse (1985) (they did not detect a metallicity gradient). 
The best-fit hd{Z) given by eq. [7] is valid only for 
\Z\ > 500 pc because of the sample bright hmit. Close 
to the plane, the mean and rms scatter of the metallic- 
ity distribution are —0.14 and 0.19 for F/G- type dwarfs 
(Nordstrom et al. 2004; AUende Prieto et al. 2004), 
and —0.12 and 0.18 for K-type giants (Girardi & Salaris 
2001), respectively. Hence, the vertical metallicity gradi- 
ent close to the plane must be larger than ~ 0.35 dex/kpc 

^^' An obvious question is whether the observed variation of the 
median metallicity for the disk component simply reflects the vary- 
ing contributions of th in- an d thick-disk stars. This question in 
addressed in detail in § 13.4.41 
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Fig. 5. — The top left panel illustrates the dependence of the median photometric metallicity for ~76,000 stars from the SDSS stripe 
82 coadded photometric catalog with 14 < g < 20.5, 0.2 < g — r < 0.4 and photometric distance in the 0.5-10 kpc range, on cylindrical 
Galactic coordinates K and Z. Note that the Z gradient is much larger than the R gradient {~0.1 dex/kpc vs. <0.01 dex/kpc). The top 
right panel shows the conditional metallicity probability distribution at a given distance from the Galactic plane for ~34,000 stars with 
0.5 < Z/kpc < 7 and 7 < i?/kpc < 9 (the probability density is shown on a logarithmic scale, with its integral normalized to 1). The 
two concentrations of stars correspond to disk (\FejH\ ~ —0.7) and halo ([_Fe/i?] ~ —1.5) stars. The difference between this map and 
a two-component Gaussian model described in § 13.31 is shown in the bottom left panel with the same dynamic range for color coding as 
used in the top right panel. The residual feature visible around [_Fe/H] ~ — 1.1 and Z ^ \ kpc can be modeled either as a third Gaussian 
component, or by adopting a non-Gaussian metallicity distribution for the disk component. The residuals map for the former is shown in 
the bottom right panel. 



implied by the extrapolation of eq. [7| (because stars on av- 
erage become more metal poor by about 0.5 dex between 
Z = and \Z\ = 1 kpc). 

To set the relative normalization of the two Gaus- 
sians, fniZ), we approximately separate halo and disk 
components by isolating stars with [Fe/H] < — 1.1 and 
[Fe/H] > —0.9, respectively. A good description of the 
data, shown by symbols in Figure [6l is provided by a 
best-fit function with three free parameters 



with a = 70, 6 = 24 pc, a nd c = 0.62. 
function further in § 13.3.41 



We discuss this 



fHiZ) = 



1 



l + aexp[-(|Z|/6)-]^ 



(8) 



3.3.1. The "metal-weak thick disk": a third Gaussian 
component or a non-Gaussian Distribution? 

The difference between the data and a two-Gaussian 
model described above is shown in the bottom left panel 
of Figure [51 As is evident, the overall behavior of the two 
dominant components is captured, but the residual map 
reveals a feature that contains intermediate-metallicity 
stars {[Fe/H] ^ —1.0) within ^^2-3 kpc from the plane. 
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2 3 4 

|Z| (kpc) 

Fig. 6. — The symbols show the number ratio of stars with 
[Fe/H] < —1 relative to those with [Fe/H\ > —1, for stars with 
7 kpc < ij < 9 kpc (with the ratio labeled on the right y axis). 
Beyond Z ~ 2.5 kpc, metal-poor stars dominate the counts. The 
solid line passing through the symbols is a three-parameter best fit 
used in modeling the conditional metallicity probability distribu- 
tion (equal to 1/(1 + 70j7), with r? = exp(-|Z/240pc|0-^2))^ Tj^^ 
short-dashed line is a prediction for the halo-to-disk counts ratio 
based on a best-fit Galaxy model to stellar counts from Juric et 
al. (2008). The model includes an o blate power-law halo, and ex- 
ponential thin and thick disks (see i) l3.3.4|l . The disk contribution 
to the counts is shown by the dotted line (with the In(counts) la- 
beled on the left y axis), and the long-dashed line shows the halo 
contribution. The sum of disk and halo contributions is shown by 
the solid line. 

This feature includes about 5% of stars in the sample and 
is reminiscent of the so-called "metal-weak thick disk" 
(Morrison, Flynn & Freeman 1990; Chiba & Beers 2000; 
Beers et al. 2002). Indeed, it can be satisfactory modeled 
as a third Gaussian component with /i = —1.0, a = 
0.10 dex and a strength of 20% relative to the metal-rich 
component, as illustrated by the smooth residual map 
shown in the bottom right panel of Figure [5l 

An alternative to postulating a third Gaussian compo- 
nent for eq.[n]is to adopt a skewed metallicity distribution 
for the disk component whose shape need not vary with 
the distance from the plane (i.e., replacing G(x\ijld,<^d) 
with a non-Gaussian distribution). A skewed shape for 
the metallicity distribution of local F/G dwarfs was also 
measured by Gilmore, Wyse & Jones (1995), but with 
an overall offset of ^--^0.5 dex towards higher metallicity, 
as would be expected compared to our data at \Z\ = 1 
kpc. 

A quantitative validation of such a universal shape for 
the disk metallicity distribution is shown in Figure [71 In 
each Z slice that shows evidence for the disk component, 
the shape of its skewed metallicity distribution can be 
modeled^^ as a sum of two Gaussians with fixed positions 
relative to each other (offset of 0.14 dex), fixed widths 
(0.21 dex for the more metal-poor component, and 0.11 
dex for the other), and fixed relative normalization (1.7- 
to-1 number ratio in favor of the more metal-poor compo- 
nent). The values of these four parameters were obtained 

^'' We attempted to fit this skewed distribution using a log- 
normal distribution, but the detailed shape could not be repro- 
duced. 



by simultaneously fitting the four histograms shown in 
Figure [7l with the position of the disk distribution fixed 
at values computed from eq.[7](the difference between the 
median given by eq. [7] and the mean for the more metal- 
poor disk Gaussian is 0.067 dex). The halo parameter 
fn was initially estimated using eq.[8l while fin = —1-46 
and an = 0.30 were kept constant. A few minor ad- 
justments to these three parameters, listed in Table 3, 
improved the fits. The most notable change is a shift of 
the halo metallicity distribution by ±0.06 dex for Z > 3 
kpc. 

The best-fit values of fn from Table 3 are consistent 
with eq. m By retaining that function, and adopting the 
above Z-independent two-Gaussian shape description for 
the disk component, we obtain a residual map that is 
indistinguishable from that obtained using a third Gaus- 
sian component. Hence, we conclude that either hypoth- 
esis can explain the data. While both are formally based 
on three Gaussian components, the "universal shape" hy- 
pothesis demonstrates that the data do not require the 
second disk Gaussian to be independent of the first. 

It is tempting to identify the two Gaussians that de- 
scribe the disk component with the thin- and thick- 
disk contributions. However, the fits presented above 
are inconsistent with this interpretation. The double- 
exponential best-fit to observed spatial profile (with scale 
heights of 245 pc and 743 pc, and a relative normaliza- 
tion of 0.13, see § 13. 3. 41 and Figure[B]) imply that the ratio 
of thick-to-thin disk stars should strongly vary from 1.9 
intheZ = 0.8-1.2 kpc bin to 14 in the Z = 1.5-2.0 kpc 
bin, and > 1000 in the Z = 3.0 - 4.0 kpc bin. Yet, the 
metallicity distributions admit a fit based on a constant 
normalization ratio for the two Gaussian components. 
Of course, this fitting success alone does not necessar- 
ily prove that traditional decomposition into two fixed 
distributions with a varying normalization ratio is incon- 
sistent with the data. We return to the problem of distin- 
guishing these two hypotheses, which have very different 
implications for galaxy formation and evolution theories, 
when discussing correlations with kinematics further be- 
low (§[3X4l). 

3.3.2. The Effects of Systematic Errors on the Photometric 
Parallax Relation 

Various systematic errors in the metallicity and dis- 
tance estimates affect the best-fit models for the metal- 
licity distribution described above. The main sources of 
systematic errors in the adopted photometric parallax re- 
lation are binarity effects (age effects can contribute at 
most 0.2 mag systematic uncertainty in absolute magni- 
tude at the median color of the sample, g — r = 0.3, 
and ~0.1 mag at g — r — 0.4; see Appendix A). As 
discussed in detail by JOS, binarity effects are expected 
to decrease the inferred distances and disk exponential 
scale height by about 15%. The impact of binarity on 
the metallicity determination is hard to estimate with- 
out detailed knowledge of both the incidence of binaries 
and their mass ratio distributions. By performing simu- 
lations similar to those described by JOS, we find that the 
worst-case scenario is a system consisting of a luminous 
low-metallicity primary with u~ g — 0.8 and g — r = 0.2, 
and a secondary with u — g ^ 1.0 and g — r — 0.5 (the 
redder secondaries are too faint to have a larger impact). 
When such a system is (mis) interpreted as a single star, 
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Fig. 7. — The symbols with error bars show the measured photometric metallicity distribution for stars with 0.2 < g — r < 0.4, 7 kpc 
< -R < 9 kpc, and distance from the Galactic plane in the range 0.8-1.2 kpc (top left, ~6,200 stars), 1.5-2.0 kpc (top right, ~3,800 stars), 
3.0—4.0 kpc (bottom left, '~2,800 stars) and 5.0—7.0 kpc (bottom right, ~6,000 stars). The histograms are essentially horizontal (parallel to 
X axis) slices at corresponding \Z\ intervals through the map shown in the top right panel of Figure [3] The dashed magenta lines show a 
best-fit two-component, halo plus disk, model. The blue lines show the halo contribution, modeled as a single Gaussian, and th e red lines 
show the contribution of a non-Gaussian disk model, modeled as a sum of two Gaussians shown by the green lines. See S IS. 3. H and Table 
3 for the best-fit parameters. 



\Z\ range"^ 



TABLE 3 
Best-fit Parameters for p{[Fe/H]\Z), Shown in Fig. [7] 



(1^1)' 



fd 

JH 



J H 



M?f 






0.8-1.2 


0.98 


6,187 


0.08 


0.14 


0.09 


-1.46 


0.30 


1.5-2.0 


1.72 


3,842 


0.24 


0.30 


0.26 


-1.46 


0.30 


3.0-4.0 


3.47 


2,792 


0.71 


0.73 


0.73 


-1.40 


0.30 


5.0-7.0 


5.79 


6,025 


0.97 


0.95 


0.95 


-1.52 


0.32 



'^ The \Z\ range for each bin (kpc).'' The median \Z\ in each bin (kpc).'^ The number of stars in 
the corresponding bin."^ The halo-to-disk number ratio predicted by the JOS best-fit model (see 
ii l3.3.4ll .° The halo-to-disk number ratio predicted by eq.[8]' The best-fit halo-to-disk number ratio. 
The halo parameters are only weakly constrained in the first bin.^ The best-fit mean metallicity 
for the halo component.'' The best-fit distribution width for the halo component. 
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the distance is underestimated by 15%, the effective tem- 
perature is underestimated by ^240 K, and the metaUic- 
ity is overestimated by ~0.2 dex. For components that 
have mass ratios closer to unity (as suggested by, e.g., 
Reid et al. 2002), the bias in metaUicity will be much 
smaller, while the bias in distance estimates can increase 
up to -40%. 

3.3.3. The Edge of the Thick Disk at \Z\ ~ 5 kpc? 

Using photographic data for 250 stars, Majewski 
(1992) advocated an "edge" of the Galactic thick disk 
at about 5.5 kpc above the Galactic plane. Indeed, the 
map of conditional metaUicity distribution shown in the 
top right panel of Figure [5] suggests a tantalizing pos- 
sibility that the metal-rich component docs not extend 
beyond 4-5 kpc from the plane. This visual impression 
is addressed quantitatively by the histogram shown in 
the bottom right panel of Figure [71 The extrapolation 
of the disk component to the 5-7 kpc bin predicts that 
the disk should be detectable as a factor of —2 excess 
around [Fe/H] ~ —0.8, on top of the underlying halo 
distribution. Such an excess seems consistent with the 
data, and argues against a cutoff in the distribution of 
disk stars. Due to the small sample size (~4,800 stars), 
the noise is large and the significance of this excess is not 
overwhelming. Even when the 7 < i?/kpc < 9 constraint 
is removed, the 5 < |Z|/kpc < 7 subsample is still too 
small (^^8,600 stars) to significantly overcome counting 
noise. 

We conclude that Stripe 82 catalog is not sufficiently 
large to convincingly demonstrate the lack of an edge in 
the distribution of disk stars. We improve the statisti- 
cal power of this analysis by using a ~30 times larger 
sky coverage provided by the DR6 catalog, as discussed 
below. 

3.3.4. A comparison with results from June et al. (2008) 

The expression given by eq. [S] is only a convenient fit 
that involves a small number of free parameters. The 
halo-to-disk number ratio, fn, depends on the disk and 
halo spatial profiles, both of which are usually modeled 
using numerous free parameters. As discussed by J08, a 
unique solution is not possible without a large sky cov- 
erage, and even in such a case it is difficult to obtain. 
Nevertheless, we can test whether the data for fn dis- 
cussed here are consistent with the best-fit spatial profiles 
obtained by J08, which do not incorporate metaUicity in- 
formation. 

J08 show that the stellar number density distribution, 
p(-R, Z, (f)) can be well described (apart from local over- 
densities) as a sum of two cylindrically symmetric com- 
ponents 

p{R,Z,(j})^ Pd{R.Z)+ ph{R,Z). (9) 

The disk component can be modeled as a sum of two 
exponential disks 

pi3(i?,Z)=p,,(i?o,0)[e-l^+^el/^i-(«-«<=)/^i (10) 
j^^jje-\z+Ze\/H2~(ii.-Re)/L2^^ 

while the halo component requires an oblate power-law 
model 

/ ^2 \ "if/2 

Ph{R, Z) = PDiRQ,0) en ^, , ®, „ . (11) 



The best-fit parameters are discussed in detail by J08. 
We have adopted the following values for parameters rel- 
evant in this work (second column in Table 10 from J08): 
Zq=25 pc. Hi = 245 pc, ifa = 743 pc, eo = 0.13, 
en = 0.0051, qn = 0.64, and uh = 2.77. 
The fraction of halo stars implied by this model. 



fH{R,z) 



Ph{R,Z) 



PD{R,Z)+pHiR,zy 



(12) 



i?2 + (Z/qny 



agrees reasonably well with fn determined here (see Fig- 
ure [ni without any modification. Given that J08 deter- 
mined their best-fit parameters using counts at Z > 10 
kpc and that the adopted photometric parallax relations 
are somewhat different, this agreement is remarkable. 
The short-dashed line representing the J08 model in Fig- 
ure [S] can be brought into essentially perfect agreement 
with the data points by decreasing Hi and H2 by 4% of 
their values (i.e., by much less than their uncertainty, 
20%, quoted by J08). Alternatively, data points can 
be moved closer to the JOS model by increasing halo 
normalization by 24%, to en = 0.0063 (~ la change). 
Hence, the results presented here validate the best-fit 
model from JG8. 

The best-fit model for stellar counts provides the useful 
guidance that the counts of thin- and thick-disk stars 
become equal around j^j ~ 1 kpc, and that the counts 
of disk and halo stars become equal around \Z\ ~ 2.5 kpc. 
At the bright end of the sample (500 pc), thin-disk stars 
contribute ~70% of stars to the disk component (with a 
halo contribution of ~1%); at the faint end (5 kpc), halo 
stars contribute ~90% of the sample. Hence, the SDSS 
imaging data are well suited for studying the metaUicity 
distribution all the way from thin disk to halo, through 
the thick-disk transition, using an essentially complete 
Uux-limited sample of numerous main-sequence stars. 

3.4. Analysis of the Large Area SDSS Data Release 6 

Sample 

SDSS Data Release 6 provides photometry for ^300 
million objects detected over ~10,000 sq.deg. of sky. 
Using selection criteria listed in § 12.3.11 and addition- 
ally requiring 0.2 < g — r < 0.4 and distances in the 
0.5-12 kpc range, we selected 2.53 million stars (the ex- 
tended 0.2 < g — r < 0.6 color range includes 5.7 million 
stars). The significantly increased sky coverage comes 
at the expense of photometric metaUicity precision: for 
metal-rich disk stars random errors increase from 0.05 
dex at a distance of 1 kpc to 0.12 dex at 4 kpc, and for 
metal-poor halo stars they increase from 0.20 dex at 4 
kpc to 0.36 dex at 7 kpc (about 3 times as large as for 
the coadded data discussed above). 

The median metaUicity map shown in Figure [5] is anal- 
ogous to that shown in Figure [5] except for the signif- 
icantly larger sky coverage. The conclusion about the 
vertical metaUicity gradient being much stronger than 
the radial gradient remains valid. The strong Z gradient 
is a result of the low-metallicity halo component becom- 
ing dominant at Z beyond ^3 kpc. The only deviation 
from this overall trend is seen in the region associated 
with the Monoceros Stream (see § 13.5.11 for a detailed 
discussion) . 

An analogous map constructed using only stars with 
[Fe/H] > —0.9 is shown in the top left panel of Figure[9l 
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Fig. 8. — The dependence of the median photometric metallicity for ~2.5 million stars from SDSS Data Release 6 with 14.5 < r < 20, 
0.2 < g — r < 0.4, and photometric distance in the 0.8-9 kpc range, in cylindrical Galactic coordinates R and \Z\ (compare to the top left 
panel of Figure[5ll. There are ~40,000 pixels (50 pc X 50 pc) contained in this map, with a minimum of 5 stars per pixel and a median of 
33 stars. Note that the gradient of the median metallicity is essentially parallel to the \Z\ axis, except in the Monoceros stream region, as 
marked. 

While it also displays a much stronger gradient in Z di- 
rection, a local maximum with an amplitude of ^0.1 dex 
is discernible at i? ^ 10 kpc and at ^--^2.5-3.5 kpc above 
the plane. The X — Y median metallicity map for this Z 
slice, shown in the top right panel of Figure[9l places this 
maximum at X ~ 10 kpc and Y ~ —2 kpc. We address 
such localized metallicity inhomogeneities further below 
(§|31D. 

The conditional metallicity distribution for DR6 stars 
within from a solar cylinder having a radius of 1 kpc 
is shown in the bottom left panel of Figure [D As evi- 
dent, there is a close resemblance between the conditional 
metallicity distribution map constructed using the Stripe 
82 catalog (Figure O and the map based on the DR6 
catalog. The best-fit parameters listed in Table 3 that 
describe the histograms shown in Figure [7] are consistent 



with the DR6 map, when the increased measurement er- 
rors are taken into account (disk and halo components 
are convolved with 0.10 dex and 0.20 dex wide Gaussians, 
respectively). 

3.4.1. The Edge of the Thick Disk Revisited 

The large number of stars in the DR6 sample enables 
a statistically robust analysis of the suspected cutoff in 
the distribution of thick-disk stars around |Z| ~ 5 kpc. 
Figure fTOl shows the metallicity distribution of ^^295, 000 
stars with 3 < i?/kpc < 12 and 5 < Z/kpc < 7. When 
a best-fit Gaussian is subtracted from the observed dis- 
tribution, a highly significant residual at [Fe/H] > — 1 
remains. This residual feature contains about 8% of the 
stars (^24,000, so counting noise is not an issue) and sug- 
gests that, even so far away from the plane, stars with 
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Fig. 9. — The top left panel is analogous to Figure |8] except that only 1.1 million stars with [Fe/H] > —0.9 are used to compute the 
median [Fe/H] (and the display stretch is harder, as indicated in the panel). Note a coherent feature at _R ~ 10 kpc and Z ~ 3 kpc. Its 
extent parallel to the Galactic plane is shown in the top right panel, which shows the median metallicity for stars with 2.5 < Z/kpc < 3.5 
(note the shifted color map). The bottom left panel shows the conditional metallicity probability distribution for '^300,000 stars from a 
cylinder perpendicular to the Galactic plane, centered on the Sun, and with a radius of 1 kpc. The dashed lines are the same fiducials 
as shown in the top right panel of Figure [5] The bottom right panel shows the median vy velocity component (heliocentric; the value of 
~220 km/s corresponds to no rotation) as a function of metallicity and distance from the Galactic plane for ~41,000 stars with Z < 7 
kpc and b > 80° (see the dashed lines in the top left panel). Note the coherent feature with slightly larger (fy) (by ~ 20-30 km/s) at 
[Fe/H] 0.6 and Z ~ 6.5 kpc. 



[Fe/H] > —0.7 arc dominated by disk stars (the disk-to- 
halo number ratio is about 2:1). The value of 8% is in 
excellent agreement with the model discussed in § 13.3.4! 
(8.8% at Z = 6 kpc). An alternative interpretation is 
that halo metallicity distribution is not Gaussian, but 
skewed towards high [Fe/H] > — 1 values, with a disk 
cutoff at [Z[ < 5 kpc. Given the remarkable agreement 
with a Gaussian for [Fe/H] < —1.1, it seems more likely 
that disk is indeed traceable to beyond 5 kpc from the 
plane. 

The small peak in the observed metallicity distribution 
visible around [Fe/H] = —0.5 is probably an artifact of 



the photometric metallicity estimator. As discussed in 
§ [2XT] and § [2X3l stars with [Fe/H] > -0.5 may be 
biased towards somewhat lower metallicity values, which 
may explain the observed (minor) effect. 

3.4.2. The Metallicity-Kinematics Maps for Stars around 
the North Galactic Pole 

Kinematic measurements can offer additional insight 
into Galactic components revealed by the metallicity dis- 
tribution. The sky coverage of the DR6 catalog includes 
the anticenter {I ~ 180°, AC hereafter) and north Galac- 
tic pole (NGP hereafter) regions, where proper motion 
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Fig. 10. — The black histogram shows the photometric metal- 
licity distribution for ~295,000 stars with 3 < ij/kpc < 12 and 
5 < |Z|/kpc < 7 (compare to the bottom right panel in Figure O. 
The dashed red line is a Gaussian with a width of 0.41 dex (imply- 
ing an intrinsic width of 0.32 dex), centered on [Fe/H] = —1.45, 
and an area of 0.92. The difference between the observed distribu- 
tion and this Gaussian is shown by the blue line, and presumably 
corresponds to disk stars. The small peak around [Fe/H] = —0.5 is 
likely an artifact of the photometric metallicity estimation method. 

measurements provide a robust constraint on the rota- 
tional velocity component even without knowledge of ra- 
dial velocity. We take proper motion measurements from 
the Munn et al. (2004) catalog based on astrometric 
measurements from SDSS and a collection of Schmidt 
photographic surveys. The proper motion measurements 
publicly available as a part of SDSS DR6 are known to 
have significant systematic errors (Munn et al., in prep.). 
Here we use a revised set of proper motion measurements 
which will become publicly available as a part of SDSS 
DR7. 

Despite the sizable random and systematic astrometric 
errors in the Schmidt surveys, the combination of a long 
baseline (^--^50 years for POSS-I survey), and a recalibra- 
tion of the photographic data using positions of SDSS 
galaxies, results in median random errors for proper mo- 
tions of only ~ 3 mas yr~^ (per component) for g < 19.5. 
Systematic errors are typically an order of magnitude 
smaller, as robustly determined by BOS using ~80,000 
spectroscopically confirmed SDSS quasars from Schnei- 
der et al. (2007). At a distance of 1 kpc, a random error 
of 3 mas yr"-'^ corresponds to a velocity error of ~15 
km/s, which is comparable to the radial velocity accu- 
racy delivered by the SDSS stellar spectroscopic survey. 
At a distance of 7 kpc, a random error of 3 mas yr~^ 
corresponds to a velocity error of 100 km/s, which still 
represents a usable measurement for large samples, given 
that the systematic errors are much smaller (~10 km/s 
at a distance of 7 kpc). The faint limit of this cata- 
log {g ~ 20) is well matched to the depth of the SDSS 
photometric metallicity sample, and thus proper motion 
measurements are available for more than 90% of the 
0.2 < g - r < 0.4 sample. The kinematics of the SDSS 
stellar sample, including the mutual consistency of kine- 
matics based on radial velocity and proper motion mea- 
surements, are discussed in detail by BOS. Here we briefly 
present a few results that are directly related to the con- 
clusions of this paper. 

For stars towards the Galactic poles, the proper motion 
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measures the radial and rotational velocity components. 
We select 55,429 unresolved sources closer than 10 kpc 
with 0.2 < g - r < 0.4 and b > 80° from SDSS DR6. We 
use the vx and vy velocity notation to emphasize the 
spatially constrained nature of this sample, and to make 
a distinction from velocity components computed using 
a measured radial velocity; in these directions, vx '^ 
vji and vy ~ «*■ We do not correct velocities for the 
solar motion relative to the Local Standard of Rest (LSR; 



"X 



-10.0 ± 0.4 km/s, V- 



-5.3 ± 0.6 km/s; and 



w| = 7.2±0.4kni/s; Dehnen & Binney 199S). Therefore, 
the mean value of vy for a non-rotating population is 
vy = -D® -H v^^'^ - 225 km/s, where Vy^^ ~ 220 km/s 
is the rotational velocity of the local standard of rest 
(Gunn, Knapp & Tremaine 1979). 

The median heliocentric rotational velocity component 
as a function of metallicity and distance from the Galac- 
tic plane in shown in the bottom right panel of Figure [9l 
The detailed kinematic behavior shows the same two 
halo and disk components as implied by the metallic- 
ity distribution. The high-metallicity disk component at 
[Fe/H] > — 1 and Z < 3 kpc lags the local rotation by 
up to <100 km/s, while the low-metallicity halo compo- 
nent at [Fe/H] < -1 and Z > 3 kpc has (vy) -- 200 
km/s. This strong metallicity-kinematic correlation is 
qualitatively the same as discussed in the seminal paper 
by Eggen, Lynden-Bell & Sandage (1962), except that 
here it is reproduced in situ with a '--^lOO times larger, 
nearly-complete sample, extending it beyond the solar 
neighborhood. 

The variation of the median vy as a function of dis- 
tance from the Galactic plane is shown separately for 
high-metallicity {[Fe/H] > —0.9) and low-metallicity 
{[Fe/H] < —1.1) subsamples in Figure [TTl The median 
Vy for ^40,000 low-metallicity stars is 205 km/s. The 
systematic velocity uncertainty, set by the systematic 
distance and proper motion errors, is about 10-20 km/s. 
Therefore, this measurement implies that the halo does 
not rotate at that level of accuracy, in contradiction with 
a result based on similar type of analysis (proper motions 
and photometric parallax) by Majewski (l992), that the 
halo counter-rotates with a speed of —55 ± 16 km/s rela- 
tive to the LSR (based on a sample of a few hundred stars 
towards the north Galactic pole). In order to make our 
Vy for halo stars agree with Majewski's result, the dis- 
tance scale given by photometric parallax relation needs 
to be increased by 25% (~0.5 mag offset in absolute mag- 
nitude scale) , which is unlikely (see Appendix A) . An al- 
ternative explanation that proper motion measurements 
used here are systematically overestimated by ^2 mas/yr 
is ruled out by independent data (BOS). In addition, the 
radial velocity measurements from SDSS spectroscopic 
survey also indicate no significant halo rotation at the 
level of ~10 km/s (Allende Prieto et al. 2006; Ivezic et 
al. 2006; BOS). Recall, as mentioned previously, that our 
data do not extend very far into the region that CaroUo 
et al. (2007) have argued is dominated by a proposed 
outer, counter-rotating, halo component. 

The median vy for ~1S,000 high-metallicity stars in- 
creases with Z. We obtained a best fit 

(vy) = 20.1 + 19.2 |Z/kpc|i-25 km/s. (13) 

An alternative linear fit 



(vy) = 32.2 |Z/kpc| km/s, 



(14) 
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Fig. 11. — The dependence of vy (heliocentric rotational velocity component) on distance from the plane, \Z\, for ~ 18,000 high- 
metallicity {[Fe/H] > -0.9, top left) and ~ 40,000 low-metaUicity {[Fe/H] < -1.1, top right) stars with 6 > 80°. In the top two panels 
individual stars are shown by small dots, and the medians in bins of Z are shown by large symbols. The 2a envelope around the medians 
is shown by dashed lines. The bottom two panels compare the medians (left) and dispersion (right) for the two subsamplcs shown in the 
top panels (dots for low-metaUicity and triangles for high-metallicity subsamples). The horizontal dashed line at vy=220 km/s in the 
bottom left panel is added to guide the eye. The other two dashed lines in the bottom left panel are best fits to the observed tiy(|Z|) 
for high-metallicity stars (see text). The open symbols in the bottom right panel show the median random velocity measurement errors 
(circles for low-metaUicity and triangles for high-metallicity subsamples), and the two dashed lines are best fits to the observed increase of 
velocity dispersion with \Z\. They assume that the intrinsic halo velocity dispersion is constant, (Ty=90 km/s, and that the intrinsic disk 
velocity dispersion increases as cry = 26 -f 10 |Z|/kpc (km/s). 

also provides a good description for (vy) as a function of as 

Z for stars with 0.2 < g - r < 0.4 and 1 < Z/kpc < 4. 
However, BOS show that the detailed kinematics of red 
stars {g — r > 0.6), which sample the smaUer Z range 
(<1 kpc) where the two fits differ by ^20 km/s, prefer 
the non-linear form given by eq. 1131 

The observed velocity dispersion for both halo and disk 
subsamples increases with Z (see the bottom right panel 
of Figure [TT|). This increase is dominated by increasing 
measurement errors (mostly due to the fact that even 
for constant proper motion errors, the velocity error in- 
creases proportionally to distance) . We find that the ob- 
served velocity dispersion for halo stars can be modeled 
with a constant intrinsic dispersion of cry = 90 km/s. 
For disk stars, the intrinsic dispersion increases with Z 



from ~36 km/s at Z = 1 kpc to ~66 km/s ai Z — A kpc. 
The decrease of rotational velocity with Z (sometimes 
called velocity lag) was first convincingly detected by Ma- 
jewski (1992), using photographic data for 250 stars. At 
Z ~ 1.5 — 2.0 kpc, he found a lag of 50±10 km/s, in good 
agreement with the value of 59 km/s given by eq. 1131 
Chiba & Beers (2000) measured a somewhat steeper gra- 
dient, of 30±3 km/s/kpc. Using proper motion data for 
about a million M dwarfs, Ivezic et al. (2005) reported 
essentially the same behavior of rotational velocity at 
Z < 1 kpc, with a median value of 34 km/s at, Z ^ 1 
kpc, which agrees well with the lag of 39 km/s obtained 
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here using F dwarfs. Using SDSS radial velocity data, 
Allende Prieto at al. (2006) measured a vertical rota- 
tional velocity gradient of 16 km/s, which is similar to the 
results based on proper motion measurements. Most re- 
cently, Girard et al. (2006) used data for 1200 red giants 
and detected "velocity shear" towards the south Galac- 
tic pole. The velocity gradient of 29 km/s/kpc given by 
eq.[T3]at Z = 2 kpc is consistent with their value of 30±3 
km/s/kpc. They also detected a vertical gradient in the 
rotational velocity dispersion of (10±3) km/s/kpc, in ex- 
cellent agreement with the value obtained here. Given 
the different tracers, analysis methods, and the possi- 
bility for north-south asymmetry, the results presented 
here and those from Girard et al. (2006) are remarkably 
consistent. 

3.4.3. A Model for the Rotational Velocity Distribution 

The full metallicity/velocity distribution in four Z 
slices is shown in Figure \Vl[ In agreement with the me- 
dian behavior shown in Figure [TTl close to the Galactic 
plane the sample is dominated by high-metallicity stars 
with a small velocity lag, and gradually becomes dom- 
inated by low-mctallicity stars with no net rotation at 
large Z. We find no correlation between velocity and 
metallicity for distant low-metallicity stars (see the bot- 
tom right panel in Figure [T^ . 

The marginal rotational velocity distributions for maps 
from Figure [12] are shown in Figure 1131 Analogously to 
modeling the metallicity distributions in § 13.3. H we first 
attempt to fit the dependence of rotational velocity dis- 
tribution on distance from the Galactic plane, Z, using a 
Gaussian for the halo component and a universal shape, 
modeled as two coupled Gaussians, for the disk compo- 
nent (an alternative approach based on thin/thick disk 
decomposition is described below). 

We heuristically model the shape of the disk velocity 
distribution as a sum of two Gaussians with fixed posi- 
tions relative to each other (an offset of 34 km/s), fixed 
widths (12 km/s and 34 km/s), and fixed relative normal- 
ization (3-to-l number ratio in favor of the more metal- 
poor component). Motivated by the behavior of {vy) for 
the [Fe/H] > —0.9 subsample (eq. [T^l), we describe the 
central velocity of the narrower Gaussian as 

u„(Z) = -3.0 +19.2 |Z/kpc|^-2^ km/s, (16) 

where the offset of 23 km/s is due to the difference be- 
tween the median of the adopted skewed velocity dis- 
tribution and the mean for the narrower Gaussian com- 
ponent. The values of these four free parameters were 
obtained by simultaneously fitting the four histograms 
shown in Figure 1131 The halo velocity distribution was 
kept fixed as a Gaussian centered on 200 km/s and with 
a width of 90 km/s. 

In each Z bin, the expected velocity measurement er- 
ror (determined from expected random errors in dis- 
tance and proper motion) was added in quadrature to 
the widths of all three Gaussians. For the disk compo- 
nent, we also add in quadrature 10.1 |.Z/kpc| km/s based 
on eq. [iSl The best-fit parameters are listed in Table 4 
and the best fits are shown in Figure [T31 As is evident, 
this simple model of a skewed distribution that slides 
linearly with Z provides a satisfactory description of the 
data. 



We use the best fits for the marginal metallicity distri- 
bution described in § 13.3.11 (see Figure [T]) , and the best 
fits for marginal rotational velocity distribution discussed 
above to model the observed joint distributions in the 
rotational velocity vs. metallicity plane, shown in Fig- 
ure 1121 In each Z bin, we simply multiply the best-fit 
marginal distributions and subtract their product from 
the observed map. There are two important assumptions 
underlying this approach: (1) The disk and halo compo- 
nents used to model the two marginal distributions map 
well onto each other, and (2) the velocity and metallicity 
distributions of each individual component are uncorre- 
lated. 

The above assumptions are borne out by the data. As 
an illustration of the residual and x^ maps, we show the 
Z = 3— 4 kpc bin, which contains similar fractions of disk 
and halo stars fFigure fH)) . The observed distribution is 
successfully modeled to within expected statistical noise 
(^30% for counts per pixel, on average). The observed 
lack of a correlation between velocity and metallicity dis- 
tributions for disk component is at odds with the tradi- 
tional thin/thick disk decomposition, which we address 
next. 

3.4.4. Difficulties with the Thin-Thick Disk Separation? 

Our model fits above show that the data for both the 
metallicity and rotational velocity distributions of disk 
stars can be fit with shapes that do not vary with Z . 
While formally these skewed distributions are modeled 
as sums of two Gaussians, these Gaussians cannot be 
readily identified with the classical thin and thick disks. 
First, their normalization ratio is constant, while double- 
exponential best-fits to the observed spatial profile pre- 
dict that the fraction of stars in each component should 
strongly vary with Z . Secondly, the normalization ra- 
tios are different, 1.7 for the metallicity distributions and 
3.0 for the rotational velocity distributions, respectively, 
which implies that the two components do not perfectly 
map onto each other. 

Despite these difficulties, it is instructive to attempt a 
traditional thin- plus thick-disk decomposition using the 
data presented here. We seek metallicity and velocity dis- 
tributions whose linear combination, with weights given 
by the observed spatial profiles, reproduces the varia- 
tion of the observed distributions with Z . This can be 
done with only a minimal reliance on models, because 
the data span a wide range of Z, and because there are 
clearly detected Z gradients in both the metallicity and 
velocity distributions, which cannot be attributed to halo 
stars. The disk spatial profile (with the JOS best-fit pa- 
rameters) implies that the \Z\ = 2.0 — 2.5 kpc bin is 
dominated by thick-disk stars (only 2% of disk stars from 
this bin are expected to belong to the thin-disk compo- 
nent, with a 38% contribution from halo stars), while 
they contribute, for example, ~67% of all disk stars in 
\Z\ = 0.8 — 1.2 kpc bin. Therefore, one can subtract 
the ^38% halo contribution from the metallicity distri- 
bution in \Z\ = 2.0 — 2.5 kpc bin, then re-normalize the 
difference, and treat it as the thick-disk metallicity distri- 
bution. It can then be subtracted, after renormalization, 
from the disk-dominated metallicity distribution in the 
\Z\ — 0.8— 1.2 kpc bin. The rcnormalizcd difference can 
be treated as a pure thin-disk distribution. The two dis- 
tributions can then be linearly combined and compared 
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log,o[count(FeH,Vy)]. b>80: Z = 0.8-1.2 kp c Z=1.5-2 kpc 
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Fig. 12. — The distribution of stars with 0.2 < g — r < 0.4 observed towards the north Galactic pole (|fe| > 80°) in the velocity-metallicity 
diagrams, and as a function of distance from the plane in the range 0.8-1.2 kpc (top left, ~1,500 stars), 1.5-2.0 kpc (top right, ~4,100 
stars), 3.0-4.0 kpc (bottom left, '^6,400 stars) and 5.0-7.0 kpc (bottom right, '^12,500 stars). The maps show the logarithm of counts 
in each pixel, according to the legend shown in the top right panel. Towards the north Galactic pole, the plotted heliocentric velocity 
component (vy) corresponds to the rotational component. Its median value for subsamples selected by [Fe/H] > — 1 in the top right panel 
is 59 km/s and 117 km/s in the bottom left panel. The median value for subsam ples selected by [Fe/H] < —1 is 192 km/s and 203 km/s 
in the bottom two panels, respectively. For marginal vy distributions, see Figure [13] The horizontal lines at vy = and vy = 220 km/s 
are added to guide the eye. The symbols in the bottom right panel show the median values of metallicity (squares) and vy (triangles) in 
narrow bins of the other coordinate. 



to distributions observed in the intermediate bins. 

The top left panel of Figure [T5l shows the result of this 
comparison for the \Z\ = 1.2 — 1.4 kpc bin. The metal- 
licity distributions for both components appear non- 
Gaussian, with a difference of their means of about 0.2 
dex. Their expected linear combination reveals discrep- 
ancies with the data, but they are not overwhelming, 
and could be due to uncertainties in the decomposition 
procedure itself. 

A similar linear decomposition method cannot be ap- 
plied to the kinematic data analyzed here because the 
velocity measurement errors increase too much over the 
relevant range of Z. Instead, we choose to model the ob- 



served velocity distributions using two Gaussians, with 
fixed mean velocities and dispersions. The widths are 
convolved with known, Z-dependent, velocity measure- 
ment errors when fitting the four free parameters. We 
normalize the two Gaussians assuming their relative con- 
tributions (and halo normalization) predicted by the den- 
sity (counts) profiles shown in Figure [6l 

The best fits are shown in Figure [T5l for three represen- 
tative Z bins, with the best-fit parameters listed in Table 
5. The presumed thick-disk Gaussian has a velocity lag 
of 48 km/s, relative to the first Gaussian (centered at 57 
km/s and 9 km/s, respectively). The implied intrinsic 
velocity dispersions are 18 km/s and 28 km/s. The fits 
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Fig. 13. — The symbols with error bars show the measured rotational velocity distribution for stars with 0.2 < g — r < 0.4, b > 80°, and 
the distance from the Galactic plane in the range in the range 0.8-1.2 kpc (top left, ~1,500 stars), 1.5—2.0 kpc (top right, ^4,100 stars), 
3.0—4.0 kpc (bottom left, ~6, 400 stars) and 5.0—7.0 kpc (bottom right, '~12,500 stars). These histograms are the marginal vy distributions 
for the maps shown in Figure [12] The red and green curves show the contribution of a non-Gaussian disk model (a sum of two Gaussians 
with fix ed, 1:3, relative normalization), the blue curves show halo contribution (a Gaussian), and the magenta curves are their sum (see 
§ 13.4.31 for details and Table 4 for best-fit parameters) . 



TABLE 4 
Best-fit Parameters for p(vy), Shown in Fig. [13] 



\Z\ range" 


(l^l)" 


N'^ 


(9)' 


«di 


4. 


-^1 


-'i2 


fk 


< 


-5 


-i 


0.8-1.2 


1.08 


1,526 


15.2 


18 


57 


23 


39 


0.12 


92 


14 


17 


1.5-2.0 


1.75 


4,076 


16.2 


36 


70 


34 


46 


0.22 


94 


23 


26 


3.0-4.0 


3.49 


6,445 


17.8 


88 


122 


65 


72 


0.68 


104 


48 


52 


5.0-7.0 


5.99 


12,452 


18.9 


176 


211 


118 


122 


0.95 


123 


93 


84 



^ The \Z\ range for the bin (kpc).'' The median \Z\ in the bin (kpc).'^ The number of stars in the 
bin."^ The median g-band magnitude in the bin.'' The mean velocity for the first Gaussian disk com- 
ponent, in km/s, computed using ea. ll6l ^ The mean velocity for the second Gaussian disk component, 
^I[i2 = ^dl + 34, (km/s).^ The velocity dispersion for the first Gaussian disk component (12 km/s) 
convolved with measurement errors (km/s).'^ The velocity dispersion for the second Gaussian disk 
component (34 km/s) convolved with measurement errors (km/s).' The best-fit halo-to-disk number 
ratio.-' The velocity dispersion for the halo component (90 km/s) convolved with measurement errors 
(km/s).'' The median velocity measurement error for stars with [Fe/H] > —0.9 (km/s).' The median 
velocity measurement error for stars with [Fe/H] < — 1.1 (km/s). 



22 



tWTA for h>SO: Z=i-A kpc 

I 1 1 I 



I J I 1 I p I I 1 I I I, I I I I 



I L I ■ I I I I I I P I I 1 I I I I' I i I , 




^H"^^ I I I! I I I I I I ' J V I I I I I I I I 



^ 1 I !^W 
-0.5 0,5 




(DATA-MODEL)/HOISE 



Is 

■'if 

8- 



I I I » I I ^ I I I I J I ■ I ■ I I I ■ I , 







■ > ■ 



- ' - ■ - 



' 



fre/H) 



!re/H) 



Fig. 14. — The top left panel shows distribution of stars (logarithm of counts in each bin) observed towards the north Gal acti c pole 
(|b| > 80°), and with 3.0 < |Z|/kpc < 4 kpc, in the velocity-inetallicity diagram (same map as in the bottom left panel in Figure [T2l l. The 
top right panel shows a best-fit model assuming that velocity and metallicity distributions are uncorrelated, when disk and halo components 
are treated separately, and are given by the best fits shown in the bottom left panel of Figure [7] for the metallicity distribution, and in the 
bottom left panel of Figure [131 for the velocity distribution. The bottom left panel shows the logarithm of the data/model ratio, displayed 
with the same dynamic range as the top two panels. The observed counts are predicted with an rms scatter of 33%. This scatter is 
consistent with the expected statistical noise. The bottom right panel shows the data-model difference map normalized by the expected 
noise. The rms width of the distribution is 1.09. 
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Fig. 15. — The top left panel illustrates a test of the hypothesis that the observed variation of metallicity distribution with \Z\ is due 
to a varying mixing ratio of the independent thin and thick disk metallicity distributions. The latter are determined using metallicity 
distributions in the \Z\ = 1.0 — 1.2 kpc and \Z\ = 2.0 — 2.5 kpc bins, and are shown as red (thin disk) and green (thick disk) lines. They 
are scaled using the 4.6:1 thick-to-thin disk number ratio expected for the displayed \Z\ = 1.2 — 1.4 kpc bin from the counts profile. The 
data for this \Z\ bin are shown by symbols with error bars. The blue line shows halo contribution (15%), and the magenta line is the sum 
of all three components. The remaining three panels show measured rotational velocity distributions in three Z slices (symbols with error 
bars; 1.0-1.2 kpc: top left; 1.2-1.4 kpc: bottom right; 1.4-1.6 kpc: bottom right). The blue curves are Gaussians corresponding to halo 
stars, with the parameters listed in Table 5. The red lines are Gaussians centered on 9 km/s, and the green lines are Gaussians centered on 
57 km/s, with widths listed in Table 5. The increase of their widths with |Z| is consistent with estimated measurement errors; the implied 
intrinsic widths are 18 km/s for narrow Gaussians, and 28 km/s for wide Gaussians. These curves are normalized according to thick-to-thin 
number ratio and halo contribution listed in Table 5. 



TABLE 5 
Best-fit Parameters for p{vy), Shown in Fig. 1151 



\Z\ range" 


(1^1)" 


N'^ 


(9)" 


-Si 


'^L 


-'h 


fh 
Jh 


' DD 


1.0-1.2 


1.11 


1,142 


15.3 


23 


32 


80 


11 


2.7 


1.2-1.4 


1.39 


1,480 


15.6 


25 


33 


81 


15 


4.6 


1.4-1.6 


1.50 


1,603 


15.9 


27 


34 


81 


19 


7.9 



^ The \Z\ range for the bin (kpc). The median \Z\ in the bin (kpc)."^ The number of stars in the 
bin.'^ The median g-band magnitude in the bin.'' The velocity dispersion for the first Gaussian 
disk component convolved with measurement errors (km/s). The velocity dispersion for the second 
Gaussian disk component convolved with measurement errors (km/s).s The velocity dispersion for 
the halo component convolved with measurement errors(km/s).'^ The fraction of halo stars from 
J08 model (%).' The thick-to-thin disk star number ratio from JOS model (see ij 13. 3. 41 1. 
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Fig. 16. — A comparison of data for 1,142 stars with fe > 80 and 1.0 < |Z|/kpc < 1.2 kpc, and a model based on a traditional disk 
decomposition into thin and thick components. The model assumes three Gaussians with contributions of thin disk, thick disk and halo 
stars equal to 28%, 52%, and 20%, respectively. The Gaussians describing the metallicity distribution (top left panel, symbols represent 
data and histogram represents model) are centered on [Fe/H\ = -0.50,-0.72 and —1.37, and have widths of 0.04, 0.15, and 0.32, dex, 
respectively. The Gaussians describing the velocity distribution (top right panel) are centered on 9, 57 and 205 km/s, and have widths of 
25, 33, and 80 km/s, respectively. The two bottom panels show the two-dimensional distributions in the velocity/metallicity diagram for 
the model (left) and the data (right), with individual stars shown as small dots. The large dots show the median velocity in 0.1 dex wide 
metallicity bins, for stars with velocity in the —60 km/s to 120 km/s range. The 2-cr envelope around these medians is shown by the two 
outer dashed lines. The dashed lines in the middle show the best linear fit to the median velocity, with slopes of —91 km/s/dex (model) 
and —4.1 km/s/dex (data). 
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Fig. 17. — Analogous to the bottom right panel of Fig |16l except that SDSS spectroscopic sample of stars is used (vy from Fig |16l 
corresponds to t)$+225 km/s). Its all-sky distribution and radial velocity measurement enable the sampling of regions closer to the Galactic 
plane than with the b > 80° sample (left: 285 stars with |Z|=600-800 pc; right: 583 stars with |Z|=800-1000 pc; both with 0.2 < g-r < 0.4). 
Although the metallicity distribution of spectroscopic sample is highly biased (see the top right panel in Fig. [Sj, the dependence of the 
median rotational velocity for narrow metallicity bins, as a function of metallicity, is not strongly affected. Note the absence of a strong 
v^ vs. \Z\ gradient (<20 km/s/dex). 



are not as good as those shown m Figure [131 but the dis- 
crepancies are not too large. They are, however, formally 
highly statistically significant, due to the large number 
of stars, but there is always a possibility for hidden sys- 
tematic errors. If the thick-disk Gaussian were replaced 
by a non-Gaussian velocity distribution, it is likely that 
the most egregious discrepancies around uy ~ 150 kni/s 
could be resolved. 

Therefore, the marginal metallicity and velocity distri- 
butions do not strongly rule out the thin/thick disk lin- 
ear combination hypothesis, but only if both disks have 
non-Gaussian metallicity and velocity distributions. The 
choice for both metallicity and rotational velocity distri- 
butions is then between a linear combination of two fixed 
non-Gaussian distributions, or a single non-Gaussian dis- 
tribution that slides with Z. A difficulty with the former 
hypothesis is that the implied thin-disk metallicity dis- 
tribution has a mean of about —0.55, which is too metal 
poor by about 0.4 dex, compared to local measurements 
(Girardi & Salaris 2001; Nordstrom et al. 2004). On the 
other hand, there is a worrisome possibility of a metallic- 
ity "compression" discussed in Appendix C, which could 
bias thin disk measurements to lower values. 

While fitting the marginal metallicity and kinematic 
distributions separately does not strongly discriminate 
among models, the thin/thick disk linear combination 
hypothesis makes a very strong prediction about the 
joint two-dimensional distribution. Because the individ- 
ual components are offset by ~0.2 dex in metallicity, and 
by ~50 km/s in rotational velocity, this hypothesis pre- 
dicts a correlation between metallicity and rotational ve- 
locity for samples selected from narrow Z slices in the Z 
range where neither component dominates. To compute 
the strength of this correlation, we generated a Monte 
Carlo sample for Z = 1.0 — 1.2 kpc bin, where the correla- 
tion should be strong, with the same size as the observed 
sample (the expected thick-to-thin disk star number ratio 
is 2.7). The distributions of the data and model stars in 
the velocity vs. metallicity plane are shown in Figure [TBI 

Even without any computation, it is evident from the 
lower left panel of Figure [TBI that the model distributions 



are strongly correlated: the 28% contribution of thin- 
disk stars, which have smaller mean velocity and higher 
metallicity than thick-disk stars, induce a net metallicity- 
velocity correlation (~ —90 km/s/dex). The value of 
Kendall's rank correlation coefficient (e.g., see Lupton 
1993) for the model distribution is -0.30 ± 0.04, which 
is significantly inconsistent with the observed value of 
0.017± 0.018 (we limit velocities to 120 km/s, and metal- 
licity to [Fe/H] > —1, to exclude halo stars). 

In order to test whether this observed lack of corre- 
lation is localized to the north Galactic pole, we have 
also evaluated the Kendall's rank correlation coefficient 
for stars with Z = 1.0 — 1.2 kpc in three 10 degree wide 
patches along the / == 180° line and with 6=50°, 30°, 
and —50°. None of these patches exhibits a statistically 
significant correlation between velocity and metallicity, 
with the value of the correlation coefficient towards the 
north Galactic pole remaining the most negative. In or- 
der to test whether the measured correlation coefficient 
depends on the selection of the Z bin, we evaluated it 
for seven 200 pc thick bins in the range Z— 0.5-2.1 kpc, 
and found values statistically consistent with the value 
for the Z = 1.0 - 1.25 kpc bin. 

For another test, one that is less sensitive to errors 
in the adopted metallicity scale, we compared velocity 
histograms for stars with —1 < [Fe/H] < —0.8 and 
—0.6 < [Fe/H] < —0.4. The observed median rota- 
tion velocities differ by 1 km/s, while the model values 
differ by 25 km/s, or eight times more than the expected 
statistical noise. To explain the observed flat median 
rotation velocity vs. metallicity behavior as due to er- 
rors in adopted absolute magnitudes, adopted Mr for 
stars with — 1 < [Fe/H] < —0.8 would have to be too 
faint by ~ 1 mag, and ~ 1.5 mag too bright for stars 
with —0.6 < [Fe/H] < —0.4. Such large errors are not 
consistent with the constraints on photometric parallax 
relation discussed in Appendix A, nor with plausible age 
effects. 

These tests would be statistically even stronger if the 
samples extended to at least Z = 0.75 kpc, where the 
expected fractions of thin and thick disk stars are equal. 
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This is not possible with the b > 80° sample due to 
image saturation (at r ^ 14), while for stars observed at 
lower Galactic latitudes, the radial velocity measurement 
is required to compute the rotational velocity component. 
Hence, we use the SDSS spectroscopic survey to extend 
the sample to Z < 1 kpc. In addition, this sample also 
critically tests a possibility that these thin/thick disk sep- 
aration difficulties are caused by photometric metallicity 
problems. 

Figure [T7] shows the u$ vs. [Fe/H] velocity diagrams 
for two samples of stars with SDSS spectra that have 
Z=600-800 pc and Z=800-1000 pc. Here, v^ is the ro- 
tational velocity component corrected for the LSR and 
peculiar solar motion (towards the NGP, w$ ~ t;y-225 
km/s). Although the metallicity distribution of the SDSS 
spectroscopic sample is highly biased (see the top right 
panel in Figure [3]), the dependence of the median rota- 
tional velocity for narrow metallicity bins, as a function 
of metallicity, is not strongly affected. These two sub- 
samples are also inconsistent with the strong velocity- 
metallicity correlation expected from traditional thin- 
thick decomposition. 

We conclude that the absence of a correlation between 
the observed velocity and metallicity distributions for disk 
stars represents a major problem for the interpretation of 
vertical velocity and metallicity gradients as being due to 
a varying linear combination of two fixed distributions. 

Our conclusion does not contradict previous work on 
this subject, because the older samples did not include 
simultaneous distance, velocity, and metallicity measure- 
ments for a sufficient number of stars with the appro- 
priate distance distribution. A test based on a ran- 
domly selected ten-times-smaller subsample, with pho- 
tometric errors and proper motion errors increased by 
a factor of two, resulted in an inconclusive difference in 
Kendall's correlation coefficients between the "data" and 
a thin/thick disk model. 

It is noteworthy that Norris (1987) proposed a Galaxy 
model which does not assume that thin and thick disks 
are discrete components, but instead form a kinemati- 
cal and chemical continuum. Stars traditionally associ- 
ated with the thick disk belong to an "extended" disk (in 
terms of spatial distribution) in Norris' terminology, and 
represent an extreme tail of metallicity and kinematic 
distributions. Our results appear roughly consistent with 
Norris' proposal, and provide a quantitative support for 
such "continuous" view of the disk over the entire rele- 
vant range of distances from the Galactic plane. 

3.4.5. The "Metal-weak Thick Disk" Revisited 

In order to test whether the third Gaussian compo- 
nent discussed in § 13.3.11 has the same kinematics as 
the rest of disk stars, we compare the vy histograms for 
-1.1 < [Fe/H] < -0.8 and -0.6 < [Fe/H] < -0.5 sub- 
samples in two Z bins: 0.5-1 kpc and 1.5-2.0 kpc. We 
find no statistically significant differences, with an upper 
limit on the relative velocity offset of ~15 km/s. This 
kinematic similarity supports the view that stars with 
— 1.1 < [Fe/H] < —0.8 reflect a non-Gaussian metal- 
licity distribution of disk stars, rather than a separate 
entity (Beers et al. 2002). Further insight will be ob- 
tained from the Rockosi et al. (in prep.) discussion of 
the properties of the metal-weak thick disk as revealed 
by the spectroscopic SEGUE sample. 



3.5. Spatially Localized Deviations from the Mean 
Metallicity Distribution 

The rich metallicity and kinematic data presented here 
enable more powerful searching for Milky Way substruc- 
ture than possible using the stellar number density maps 
alone (e.g., see JOS). We utilize the large sky coverage of 
the DR6 catalog to quantify spatial deviations from the 
conditional metallicity distribution discussed in ii l3.2l and 
13.31 We first constructed 51 maps, such as that shown 
in the top right panel of Figure [5l for regions defined by 
a 1 kpc by 1 kpc square footprint in the X — Y plane. 
In each map, we compute the median metallicity in two 
Z slices that are dominated by disk (1-2 kpc) and halo 
(5-7 kpc) stars. The range of cylindrical radius, R, is 
6-14 kpc for the disk slice, and 5-15 kpc for the halo 
slice. 

In the range 6 < i?/kpc < 10, the median disk metal- 
licity is [Fe/H] ~ —0.72, with an rms scatter of 0.05 dex, 
and a median distribution width of 0.27 dex. There is 
no discernible radial metallicity gradient, with an upper 
limit of 0.01 dex/kpc. For i? > 10 kpc, the median metal- 
hcity is [Fe/H] = -0.80, with the shift of 0.08 dex proba- 
bly due to stars from the Monoceros stream, as discussed 
below. The median halo metallicity is [Fe/H] — —1.40, 
with an rms scatter of 0.03 dex, and the median distri- 
bution width of 0.41 dex. There is no discernible ra- 
dial metallicity gradient, with an upper limit of 0.005 
dex/kpc. 

The mean expected statistical scatter in the medians 
is 0.005 dex for the disk and 0.011 for the halo (with a 
wider distribution and fewer stars for the latter), suggest- 
ing that the variation of the median halo metallicity is 
probably insignificant, while the rms variation of the me- 
dian disk metallicity of 0.05 dex appears real. The photo- 
metric calibration errors in individual SDSS runs, which 
could produce a metallicity scatter of a similar magni- 
tude, are averaged out because many runs contribute to 
each map. Furthermore, such an instrumental effect is 
ruled out by the fact that the u-band calibration errors 
would have to have an rms of 0.02 mag to produce the 
disk median metallicity rms of 0.05 dex, and only 0.006 
mag to produce the halo rms of 0.03 dex. Hence, were 
the disk rms due to calibration errors, the halo rms would 
have been 0.1 dex, and not 0.03 dex. 

As an additional method to search for localized sub- 
structure, we subtracted a best-fit model from each map 
(such as those described in § I3.3P , and visually inspected 
residual maps. The only strong feature found in residual 
maps was localized at Y ~0, Z ~ 3 — 4 kpc and i? ~ 15 
kpc, and represents an excess of [Fe/H] ^ — 1 stars. It is 
clearly visible in the median metallicity RZ map and, as 
an especially striking feature, in a conditional metallic- 
ity distribution map shown in Figure 1181 Using its spa- 
tial position, we identify this feature as the Monoceros 
stream^^, discovered in SDSS data using stellar counts 
by Newberg et al. (2002). 

3.5.1. The Metallicity Distribution for the Monoceros 

Stream 

^^ Immediately following its discovery, it was not clear whether 
the Monoceros stellar overdensity was a ring, stream or due to disk 
flaring. Subsequent work has demonstrated its stream-like profile, 
see, e.g., maps in JOS. 
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Fig. 18. — The top left panel illustrates the dependence of the median photometric metallicity for ~1.04 million stars from SDSS Data 
Release 6 with 14.5 < r < 20, 0.2 < g — r < 0.4 and |y| < 1 kpc, in cylindrical Galactic coordinates R and \Z\. This Y range is selected to 
include the Monoceros stream, which represents an overdensity by a factor of ~1.5-2 in a region around R ~ 15 kpc and \Z\ ~3-4 kpc. As 
discernible from the map, this region has a larger median metallicity than expected for this \Z\ range from smaller R. The top right panel 
shows the conditional metallicity probability distribution for a subsample of ~111,000 stars with 3 < |Z|/kpc < 4. The strong overdensity 
at R > 12 kpc is the Monoceros stream. The bottom panels show the metallicity distribution (symbols with error bars) for a subsample of 
~40,000 stars with 6 < ij/kpc < 9 (left) and for ~12,000 stars with 13 < i?/kpc < 16. The curves have the same meaning as in Figure [71 
with the addition of the cyan curve in the right panel. This curve is a 0.22 dex wid e Gaus sian centered on [Fe/H] = —0.95, and accounts 
for 33% stars in the sample that presumably belong to the Monoceros stream (see ii l3.5.1l for details). 



The conditional metallicity map from Figure [TH] 
demonstrates that regions with i? < 12 are not strongly 
affected by the Monoceros stream. We compare the 
metallicity distributions for stars with 6 < i?/kpc < 9 
(control sample), and for stars with 13 < i?/kpc < 16, 
in the two bottom panels in Figure [THl The metallic- 
ity distribution of the control sample is consistent with 
the halo and disk metallicity distributions described in 
ii l3.3.1[ with a few minor adjustments: the disk distribu- 
tion is shifted by 0.07 dex, and the halo distribution by 
0.02 dex, towards higher metallicity, the fraction of halo 
stars is changed from 61% to 55%, and 0.16 dex is added 
in quadrature to the widths of the three Gaussians to ac- 
count for the increased metallicity errors in single-epoch 
DR6 data. 

The subsample containing the Monoceros stream can 



be described using the same function as for the control 
sample, and an additional 0.22 dex wide Gaussian com- 
ponent centered on [Fe/H] — —0.95, and with a relative 
normalization of 33%. When corrected for measurement 
errors, the implied width of the metallicity distribution 
for the Monoceros stream is 0.15 dex. The best-fit nor- 
malization is in good agreement with spatial profiles from 
JOS, which suggests that the Monoceros stream is about 
a factor of two overdensity over the local background 
counts (i.e., a relative normalization of 50%). 

3.5.2. The Kinematics of the Monoceros Stream 

We select a subsample of '--^l 1,000 stars that maxi- 
mizes the fraction of stars from the Monoceros stream, 
and allows an estimate of rotational velocity using only 
longitudinal proper motion, by requiring \Y\ < 1 kpc. 
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Fig. 19. — The top left panel shows the distribution of ~7,200 stars (logarithm of counts in each bin) with |y| < 1 kpc, 3 < |Z|/kpc < 4, 
9 < R/kpc < 12 and 170 ° < / < 190° , in the longitudinal velocity vs. photometric metallicity plane (a slice through the map shown in the 
top right panel of Figure [Tsj l. The longitudinal velocity, vi, is based on proper motion measurements, and at selected I ~ 180° corresponds 
to the rotational component (heliocentric). The top right panel is analogous, except that ~12,000 stars with 170° < Z < 190° are selected 
from the 13 < R/hpc < 16 radial range, which maximizes the fraction of stars from the Monoceros stream (clustered around [Fe/H] ~ — 1 
and vi ~ 25 km/s. The difference of these two maps is shown in the bottom left panel. The bottom right panel shows the median latitudinal 
velocity (vf,) for stars in the latter (Monoceros) subsample (±100 km/s stretch, green corresponds to km/s). The analogous map for the 
9 < ij/kpc < 12 subsample appears similar. 



3 < Z/kpc < 4, 13 < i?/kpc < 16 and 170° < I < 190°. 
The distribution of these stars in the rotational velocity 
vs. metallicity diagram is shown in the top right panel 
of Figure [T9l It is discernible that the Monoceros stream 
has kinematics more similar to disk stars than to halo 
stars. We obtain a more accurate assessment with the 
aid of an analogous map for a control sample selected 
using similar criteria, except that 9 < i?/kpc < 12 (top 
left panel) . The difference of these two maps is shown in 
the bottom left panel. 

The excess stars are centered on [Fe/H] ~ —1, as ex- 
pected from the best-fit described above. Their veloc- 
ity distribution shows a strong peak at vi ^ vy ^ —50 



km/s, with a long tail towards more positive velocities. 
This residual map indicates that most of the Monoceros 
stream stars move in the direction of LSR rotation with 
velocities of up to ^270 km/s. This result is qualitatively 
in agreement with Penarrubia et al. (2005), who ruled 
out retrograde motions using models and proper motion 
measurements . 

The latitudinal velocity, Vb, based on the latitudinal 
proper motion component, is a linear combination of the 
radial and vertical velocity components. The median lat- 
itudinal velocity of stars from the Monoceros stream re- 
gion in the rotational velocity vs. metallicity plane is 
shown in the bottom right panel of Figure (THl There 
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is no significant offset from (< 20 km/s) in parts of 
the diagram wfiere the excess of the Monoceros stream 
stars is the highest. This presumably indicates that con- 
tributions from the radial and vertical motion for the 
Monoceros stream stars cancel (for disk stars, the me- 
dians for both components should be 0). Together with 
the radial velocity measurements obtained by Conn et al. 
(2005) and Martin et al. (2006), this information can be 
used to further refine models, such as those described by 
Penarrubia et al. (2005). 

4. DISCUSSION AND CONCLUSIONS 

The spectroscopic stellar parameters for over 60,000 F 
and G dwarfs, computed by the SEGUE Stellar Param- 
eters Pipeline (Beers et al. 2006; AUcnde Prieto et al. 
2006, 2007; Lee et al. 2007ab) using the SDSS spectro- 
scopic database, allowed us to derive photometric esti- 
mators for effective temperature and metallicity in the 
SDSS photometric system. The availability of the SDSS 
imaging survey, with its accurate ugr photometry, then 
enabled an unbiased volume-limited study of the stellar 
metallicity distribution to a distance limit of ^^10 kpc. 

4.1. Photometric Estimates for Effective Temperature 
and Metallicity 

The photometric metallicity estimator based on the 
SDSS u ^ g and g — r colors is reminiscent of the tradi- 
tional 5{U — i?)o.6 method based on the UBV photom- 
etry. It reflects the same physics, and can be applied to 
F and G type main sequence stars (0.2 < g — r < 0.6). 
For the SDSS single-epoch main survey data, it provides 
metallicity accurate to 0.2 dex or better for stars brighter 
than about g — 19.5. In this magnitude and color 
range, the photometric effective temperature estimator 
reproduces spectroscopic temperature with a root-mean- 
square scatter of only ^100 K. The accuracy'^^ of 100 K 
for effective temperature, and 0.2 dex or better for metal- 
licity, are comparable to parameter accuracy achieved 
using artificial neural networks with spectroscopic obser- 
vations (Snider et al. 2001), and the estimated accuracy 
of parameters determined from SDSS spectra (Beers et 
al. 2007). It is plausible that increased photometric ac- 
curacy would further improve these values. For exam- 
ple, photometry accurate to 1% may enable MK spectral 
type determination with errors smaller than one subtype. 
Derived mapping from color space to metallicity implies 
that, at least formally, an error in the u — g color of 0.01 
mag induces a metallicity error that varies from 0.01 dex 
at [Fe/H] = -0.5 to 0.06 dex at [Fe/H] = -1.5. This 
is as good a performance as obtained by the Stromgren 
uvhyP narrow-band photometric system which was opti- 
mized for this purpose (e.g., Stromgren 1966; Nordstrom 
et al. 2004). In other words, the increase of metallicity 
errors due to increased bandpass width can be compen- 
sated for by an improved photometric accuracy. 

We apply these methods to a photometric catalog of 
coadded SDSS observations from the so-called Stripe 82 
(Ivezic et al. 2007). These deeper and photometrically 
exceedingly precise coadded data allowed us to measure 

^^ Here, precision may be a more appropriate terminology than 
accuracy because the photometric estimates are tied to SDSS spec- 
troscopic values rather than to an absolute metallicity scale. That 
is, all systematic errors in SDSS spectroscopic parameters are in- 
herited by photometric estimators. 



an unbiased metallicity distribution for a volume lim,- 
ited sample of ^^200, 000 F and G type stars in the 0.5-8 
kpc distance range. We also study the metallicity dis- 
tribution using a shallower, but much larger, sample of 
several million stars in 8500 sq. deg. of sky provided by 
SDSS Data Release 6. The large sky coverage tests the 
conclusions derived using the relatively small Stripe 82 
sample and enables detection of coherent substructures 
in the kinematics-metallicity space, such as the Mono- 
ceros stream. 

4.2. The Milky Way Structure and Multi- dimensional 
Stellar Counts 

From an observer's point of view, the ultimate goal of 
Milky Way studies is to measure and describe the distri- 
bution (counts) of stars in the space spanned by apparent 
brightness, colors in multiple bandpasses'*" , two proper 
motion components (/ij, fib), radial velocity (vrad), and 
position on the sky. Specialized to SDSS data, we seek to 
understand the 8-dimensional probability density func- 
tion, p{g, u—g,g—r,fii,fii,,Vrad, I, b)- This function could 
be described empirically, without any reference to stellar 
and Galactic structure, but in practice measurements are 
used to place constraints on the latter. From a theorist's 
point of view, the problem of interpreting data can be 
rephrased as: given a small control volume centered on 
(i?,Z,c/.), 

• What is the distribution of stars as a function of 
luminosity^^ , L, 

• What is the metallicity distribution for a given L, 
and 

• What are the distributions of three velocity compo- 
nents, W0, vr and vz, for a given L and metallicity? 

Guided by these questions, we can write 

pig, u- g,g-r, nuHb, Vrad, I, b) = (17) 

Pi{9,9-r\l,b) 
xp2{u- g\g,g~r,l,b) 
X Psif^utJ-b, Vrad\u - g,g,g-r, I, b), 

where the three functions on the right side of eq. [17] are 
discussed in this and two companion papers: 

1. The function pi{g, g — r\l,b) describes the behavior 
of the g vs. g — r color-magnitude (Hess) diagram 
as a function of position on the sky. This behavior 
is a reflection of stellar luminosity function, $(i), 
and the dependence of stellar number density on 
spatial coordinates, p{R, Z, (p). A best-fit model for 
p(i?, Z, (p) developed by J08 is discussed in § 13.3.41 

'"' We limit this analysis to two colors, u — g and g — r, which pro- 
vide good estimates of effective temperature and metallicity for F 
and G main-sequence stars. However, an alternative is to consider 
a full spectral shape, instead of colors, which carries all the rele- 
vant information about the three main stellar parameters {T^jj, 
log{g)), [FE/H]. 

^^ At least in principle, it would be desirable to measure the 
stellar luminosity function, and other quantities, as a function of 
stellar age, but estimating age for individual stars is very difficult 
(e.g., Nordstrom et al. 2004; Jorgensen & Lindegren 2005), and 
beyond the scope of this paper. 
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2. The function P2(m—5|(7, g—r, I, b) describes the u~g 
color distribution for a given bin in the g vs. g — r 
color-magnitude diagram, and as a function of posi- 
tion on the sky. The u^g color distribution reflects 
the mctallicity distribution, p([Fe/H]\R, Z, </)), and 
p{R, Z, (f>). In this contribution, we show that, sim- 
ilarly to p{R,Z,(J3), p{[Fe/H]\R,Z,(l}) can be weU 
described (apart from local overdensities) as a sum 
of two components 



pix^[Fe/H]\R,Z,^) = 

[l-.fHiR,Z)]pDix\Z) 



(18) 



fH{R,Z)pH{x) 



where the halo-to-disk counts ratio is simply 
fniR, Z) = pHiR, Z)/[pd{R, Z) + ph{R. Z)\. 

The halo metallicity distribution, ^^([i^e/iJ]), can 
be modeled as a spatially invariant Gaussian cen- 
tered on [Fe/iJ] = —1.46 (for the Stripe 82 catalog; 
for the full DR6 sample, the median [Fe/H]=— 1.40) 
and with the intrinsic (not including measurement 
errors) width gh = 0.30 dex. For \Z\ < 10 kpc, an 
upper limit on the halo radial metallicity gradient 
is 0.005 dex/kpc. 

The metallicity distribution of the disk component 
has an rms scatter of 0.16 dex, with the median 
varying as 

Hd{Z) = -0.78 + 0.35 exp(-|Z|/1.0kpc) dex, (19) 

at \Z\ > 0.5 kpc. In the \Z\ = 1.0 - 1.5 kpc range, 
the median metallicity is consistent with the mea- 
surements by Gilmore & Wyse (1995). For |.^| < 5 
kpc, an upper limit on the disk radial metallicity 
gradient is 0.010 dex/kpc. The shape of the metal- 
licity distribution of the disk component is non- 
Gaussian and can be modeled as 

PD{x^[Fe/H]\Z)= (20) 

0.37G[x\n = a{Z) + 0.14, a = 0.11] 
+ 0.63 G[a;|^ = a(Z),cr = 0.21], 

where the position a{Z) and the median p]j(Z) are 
related via a{Z) = hd{Z) — 0.067 (unless measure- 
ment errors are very large). These results represent 
powerful new constraints for the Galaxy formation 
and chemical evolution models (e.g., Tinsley 1975; 
Pagel & Patchett 1975; Wyse & Gilmore 1995; and 
references therein). 

3. The hmciiox). p2,{pi, p,b,Vrad\u - g,g,g - r, I, b) de- 
scribes proper motion and radial velocity measure- 
ments for a given bin in the g vs. g ~ r color- 
magnitude, as a function of position on the sky, 
and as a function of the u — g color. This function 
locally reflects the behavior of the velocity ellipsoid, 
but SDSS data probe sufficiently large distances to 
detect its spatial variation, as discussed in detail by 
B08. They find that the detailed behavior of kine- 
matics can also be well described (apart from local 
overdensities) as a sum of two components, disk 
and halo, that map well to components detected 
in spatial profiles and metallicity distribution. The 
non-rotating halo component has by and large spa- 
tially uniform kinematics (in an overall sense, e.g., 
B08 discuss several kinematically coherent struc- 
tures) , while the disk kinematics are dominated by 



a vertical {Z) gradient. The mean rotational ve- 
locity and the three velocity dispersions for disk 
stars can be modeled as relatively simple functions 
of the form a + b\Z\'^, as discussed in detail by 
B08 (see also Girard et al. 2006). The shape of 
the rotational velocity distribution for disk compo- 
nent is non-Gaussian and can be modeled, in the 
\Z\ = 0.8 — 5.0 kpc range (and R ^ 8 kpc), as 



Pd{x^v^\Z) = 

0.25G[x\fi ^ Vn{Z),(7 ^ 12] 
+ 0.75 G[x\^ = Vn{Z) + 34, cr = 34], 



(21) 



where 



Vn{Z) = -3 + 19.2 l^/kpcji-^'^ km/s. 



(22) 



We reiterate that the widths listed for 
p]j([Fe/ H]\Z) and pD(v<i>\Z) are intrinsic widths 
and have to he convolved with measurement errors 
when comparing to observations. The listed widths 
are measured with a relative accuracy of ~10%. 

These functions provide a good description of the over- 
all features in the distribution of stars in the spatial- 
kinematic-metallicity space, as observed by SDSS. Qual- 
itatively, these results are in fair agreement with previous 
work (e.g., Gilmore & Wyse 1985; Gilmore, Wyse & Kui- 
jken 1989; Majewski 1993; Nordstrom et al. 2004; Ghard 
et al. 2006). Quantitatively, the availability of SDSS 
data is enabling unprecedentedly powerful and robust 
studies, not only due to its large volume, but also thanks 
to its accurate and diverse measurements. In particu- 
lar, with the SDSS data, the reach of massive statistical 
studies can now be extended from < 100 pc (the HIP- 
PARCOS distance limit, e.g., Dehnen & Binney 1998; 
Nordstrom et al. 2004) to ~10 kpc. 

The results presented here are only a brief illustration 
of the great scientific potential of the SDSS stellar spec- 
troscopic database. This dataset will remain a cutting 
edge resource for a long time, because other major ongo- 
ing and upcoming stellar spectroscopic surveys are either 
shallower (e.g., RAVE, 9 < / < 12), or have a signif- 
icantly narrower wavelength coverage and depth (Gaia, 
r< 17). 

4.2.1. Is there a thick disk? 

Perhaps the most significant result of our study, in 
addition to detection of the abundant substructure in 
metallicity space, is that transition from the thin to 
the thick disk, seen (and originally defined by Gilmore 
and Reid 1983) as an abrupt change of slope in the 
log(counts) vs. Z plot around \Z\ ^ 1 kpc, can be mod- 
eled as smooth shifts of mctallicity and velocity distri- 
butions that do not change their shape. More quantita- 
tively, using the above notation, the disk metallicity and 
velocity distribution can be described as 



PD{x = v,s,OT[Fe/H]\Z) = 
ni{Z)G[x\pi{Z),(Ti] 



(23) 



n2{Z)G[x\p2{Z),(T2] 



Traditionally, the two components are interpreted as thin 
and thick disks, and ni{Z) and n2{Z) are constrained by 
stellar number counts. They are modeled as exponential 
functions with scale heights of ^300 pc and ^1000 pc, 
with pi{Z) and p2{Z), typically assumed independent 
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of Z . This description is only mildly inconsistent with 
the observed marginal metallicity and velocity distribu- 
tions. However, when the two distributions are analyzed 
simultaneously, this decomposition faces a serious diffi- 
culty. Because it combines different metallicity and ve- 
locity distributions for thin- and thick-disk components 
(the data require offsets of 0.2 dex and 48 km/s), it pre- 
dicts a strong and detectable correlation between them. 
The data presented here do not display any significant 
correlation, and rule out this prediction at a highly sig- 
nificant level (~ Str). 

We find an alternative interpretation that does not im- 
ply a strong correlation between metallicity and velocity 
distributions. Formally, we find that the data can be fit 
with ni and 712 that do not vary with Z (eas. l20l and l21|) . 
while /ii and /^2 are coupled and vary with Z according to 
eqs.[T51and[^ This ability to describe the disk metallic- 
ity and velocity distributions using functions with univer- 
sal Z-independent shapes has fundamental implications 
for its origin: instead of two distinct components, our 
data can be interpreted with a single disk, albeit with 
metallicity and velocity distributions more complex than 
a Gaussian (note that the data require non-Gaussian dis- 
tributions even in the traditional interpretation). While 
the disk separation into thin and thick components may 
still be a useful concept to describe the fairly abrupt 
change of number density around |Z| ~ 1 kpc (which is 
detected beyond doubt, see JOS for SDSS results), the 
disk spatial profile may simply indicate a complex struc- 
ture (i.e. not a single exponential function), rather than 
two distinct entities with different formation and evo- 
lution history. If this is correct, then our results imply 
that different processes led to the observed metallicity and 
velocity distributions of disk stars, rather than a single 
process, such as mergers or an increase of velocity disper- 
sion due to scattering, that simultaneously shaped both 
distributions. 

On the other hand, it appears that stars from the 
solar neighborhood, believed to be thick-disk stars be- 
cause of their kinematic behavior, have larger a-element 
abundances, at the same [Fe/ H], than do thin-disk stars 
(e.g., Fuhrmann 2004; Bensby et al. 2004; Feltzing 2006; 
Reddy et al. 2006; Ramirez et al. 2007). The thick-disk 
stars, again selected kinematically, appear older than 
the thin-disk stars (e.g., Fuhrmann 2004; Bensby et al. 
2004). Thus, it is possible that the data presented here 
are insufficient to distinguish detailed elemental and age 
differences, and that high-resolution spectroscopy is re- 
quired to do so. If such supplemental data were available, 
for example, for the ~20,000 stars analyzed in Figures [7] 
and 1131 one could determine whether the distributions 
of individual a-elements admit a universal shape, and 
whether they are correlated with kinematics. These stars 
are confined to several hundred sq.deg. of sky, with a 
sky density of ~100 deg~^, and those at \Z\ < 4 kpc 
have g < 18. Such an undertaking is within the easy 
reach of modern spectrographs installed on lOm-class 
telescopes. High-resolution studies of slightly brighter 
subsets of stars are planned to be undertaken with the 
APOGEE subsurvey, part of the proposed next extension 
of the SDSS, SDSS-m. 

4.2.2. Multidimensional Substructure 



The samples discussed here are sufficiently large to con- 
strain the global behavior of metallicity distribution and 
to search for anomalies. The halo metallicity distribution 
is remarkably uniform. The rms scatter in the median 
value for 2 kpc^ large bins of only 0.03 dex is consistent 
with expected statistical noise. The median disk metal- 
licity in 1 kpc^ bins in the Z = 1 — 2 kpc range exhibits 
a statistically significant, but still fairly small rms scat- 
ter of 0.05 dex. We detect a vertical metallicity gradient 
for disk stars (0.1-0.2 dex/kpc), but radial gradients are 
limited to <0.01 dex/kpc for both disk and halo compo- 
nents, outside of regions with strong substructure. 

The strongest overdensity identified in the metallicity 
space is the Monoceros stream. Its metallicity distribu- 
tion is distinct from those for both halo and disk, and 
has a similar width as the metallicity distribution of disk 
stars (~0.15 dex). Hence, recent discoveries of abundant 
substructure in stellar spatial distribution and kinemat- 
ics are now extended to metallicity space. We concur 
with Nordstrom et al. (2004) that "the Galaxy is a far 
more complicated and interesting subject than ever be- 
fore" . 



4.2.3. Implications for Future Imaging Surveys 

The analysis and conclusions presented here are rele- 
vant for upcoming large-scale deep optical surveys such 
as the Dark Energy Survey (Flaugher et al. 2007), Pan- 
STARRS (Kaiser 2002) and the Large Synoptic Survey 
Telescope (Tyson 2002, LSST hereafter). Of these, only 
LSST plans to obtain data in the u band^^. Over its 10- 
year long lifetime, the LSST survey will obtain about 60 
observations in the u band of a 20,000 deg^ area. Thanks 
to its large aperture, the median Scr depth of ~ 24 (for 
point sources) will be significantly fainter than for SDSS 
data (22.5), and the coadded data will reach a 5ct depth 
of u = 26. The potential of photometric metallicity es- 
timator for studying the evolution and structure of the 
Milky Way demonstrated here bodes well for LSST sci- 
ence mission. 

Using SDSS data, we estimate the number of stars 
for which LSST will provide metallicity measurements. 
Based on the discussion presented in §2, we adopt an er- 
ror in the u—g color of 0.05 mag as a practical limit for ro- 
bust metallicity studies. This color error corresponds to 
a metallicity error of 0.1 dex for metal-rich stars, and 0.2 
dex for metal-poor stars. The LSST data will achieve this 
color accuracy for stars with 0.2 < g—r < 0.6 if g < 23.5. 
This is about 4 magnitudes deeper than the analogous 
limit for SDSS survey. Based on the counts of SDSS 
stars, we estimate that LSST will measure metallicity 
accurate to 0.2 dex or better*'^ for at least 200 million 
F/G main sequence stars brighter than g — 23.5 (with- 
out accounting for the fact that stellar counts greatly 
increase close to the Galactic plane). For these stars^*, 

42 rpj^ g LSST science requirements document is available from 
|http://w ww. lsst.org/Science/lsst_baseline.shtml 

*'' At the bright end, LSST color errors will be <0.01 mag. An 
error of 0.01 mag in the g — r color corresponds to a 50 K random 
error in effective temperature, and an error of 0.01 mag in the 
u — g color corresponds to a random metallicity error of 0.01 dex 
at [Fe/H] = -0.5 and 0.05 dex at [Fe/H] = -1.5. 

*■* The 200 million stars from the "metallicity" sample will be 
observed over 250 times in the g and r bands with signal-to-noise 
ratios of about 20 or larger per observation even at the faint end 
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LSST will also provide proper motion measurements ac- 
curate to about 0.2 mas/yr aX g — 21 and 0.5 mas/yr at 
g = 23 (about 10 times more accurate and ~3 magni- 
tudes deeper than the SDSS-POSS catalog by Munn et 
al. used in this work). This data set will represent a deep 
complement to the Gaia mission {g < 20; Ferryman et 
al. 2001; Wilkinson et al. 2005), and will enable detailed 
exploration of the Milky Way halo in a six-dimensional 
space spanned by three spatial coordinates, two veloc- 
ity components and metallicity, within a distance limit 
of ~100 kpc. This study can be regarded as one of first 
steps in this mapping endeavor, which is bound to pro- 
vide unprecedented clues about the formation and evo- 
lution of our Galaxy. Indeed, "these are exciting times 
to study local galaxies" (Wyse 2006). 
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APPENDIX 
PHOTOMETRIC PARALLAX RELATION DERIVED USING GLOBULAR CLUSTERS 

In Paper I, we proposed a photometric parallax relation that did not explicitly use metallicity information, for two 
main reasons. First, the analysis included stars close to the faint limit of SDSS imaging for which the accuracy of 
photometric metallicity is significantly deteriorated due to increased w-band noise, and, secondly, the sample also 
included red stars for which metallicity is difficult to estimate. The photometric parallax relation adopted in Paper 
I implicitly takes metallicity effects into account by being somewhat shallower than a photometric parallax relation 
appropriate for a single-metallicity population: nearby stars (<1 kpc, or so), which are predominantly red (due to 
the use of a flux-limited sample), have on average high disk-like metallicites, while distant stars (^^1-10 kpc) are 
predominantly blue stars with low metallicities (at a given g — r oi g — i color, luminosity increases with metallicity for 
main-sequence stars). However, here we discuss only stars for which photometric metallicity estimates are available 
and, furthermore, they do not include very faint stars due to the flux limit (u < 21) imposed by requiring proper motion 
information. Hence, we can explicitly account for shifts of photometric parallax relation as a function of metallicity. 

The color-magnitude diagrams for globular clusters can be used to constrain the photometric parallax relation and 
its dependence on metallicity, and to estimate systematic errors using the residuals between the adopted relation and 
individual clusters. For example, using three fiducial cluster sequences, Mv{B — V), corresponding to metallicities, 
[Fe/H], of —2.20, —0.71 and -1-0.12, Beers et al. (2000) spline interpolate between them to obtain My for an arbitrary 
combination oi B — V and [Fe/H]. This is the method used to compute main-sequence distance estimates available 
from SDSS Data Release catalogs. 

There are several reasons to revisit the method developed by Beers et al. First, a transformation from Johnson 
system to SDSS system is required to apply their method to SDSS data. While this transformation is known to about 
0.01 mag (Ivezic et al. 2006a), even such a small systematic error results in an uncertainty of absolute magnitude of 
^--^0.12 mag for blue stars. Secondly, only three fiducial color-magnitude sequences are used, and it is not clear whether 
spline interpolation captures in detail the shift of the main sequence as a function of metallicity. Thirdly, the impact 
of age variations on the assumed absolute magnitudes is not quantitatively known. Furthermore, it is not known 
how similar color-magnitude sequences are for different clusters with similar metallicity. It is, therefore, desirable to 
determine photometric parallax relation using a larger number of clusters, with at least some of them observed by 
SDSS. 

We use five globular clusters observed by SDSS, selected to have distance in the range 7-12 kpc (using distances 
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TABLE 6 

The Globular Clusters Observed by SDSS and Used in the Photometric Parallax Analysis 



Name 


D" 


R*- 


[Fe/Hr„ 


[F^/mt^. 


N= 


" mill 


■ g 
5'^max 


Ar" 


M 2 


11.5 


10.0 


-1.62 


-1.66 


472 


0.40 


0.70 


0.00 


M 3 


10.4 


17.5 


-1.57 


-1.41 


1279 


0.35 


0.80 


0.03 


M 5 


7.5 


17.5 


-1.27 


-1.27 


1776 


0.40 


1.10 


-0.07 


M 13 


7.7 


15.0 


-1.54 


-1.65 


829 


0.40 


1.00 


0.06 


M 15 


10.3 


12.5 


-2.26 


-2.09 


676 


0.30 


0.70 


0.01 



^ Distance, in kpc, taken from Harris (1996).*^ Angular radius (arcmin) used for selecting cluster stars, 
taken from Simones, Newberg & Cole (2008)'^ Metallicity, taken from Harris (1996)'* Median photometric 
metallicity for stars with 0.3 < g — i < 0.5 and u < 21.5° The number of stars used for estimating [Fe/H]^/^ 
(errors are dominated by systematics)' The minimum g — i color used in the analysis (determined by turn-off 
stars)s The maximum g — i color used in the analysis (determined from r < 2 1.5)*' Th e median r-band offset 
(mag) for stars with 0.5 < g — i < 0.7, relative to a prediction based on egs. lAlHASI fusing distances listed 
in the second column). 

from Harris 1996), to constrain the shape of the photometric paraUax relation. This distance range ensures sufficient 
photometric quahty for stars in the color range g — i < 0.8 {g — r < 0.6), where photometric metallicity estimates are 
reliable. We augment this sample by data for six additional clusters compiled by VandenBerg & Clem (2003), which 
significantly increase the sampled metallicity range and allow us to determine the shift of photometric parallax relation 
as a function of metallicity. We use additional clusters observed by SDSS and by Clem, VandenBerg & Stetson (2008), 
as well as constraints based on Hipparcos and ground-based trigonometric parallax measurements, to test our adopted 
photometric parallax relation. 

Methodology and Results 

For clusters observed by SDSS, we select candidate cluster stars by limiting their angular distances from the cluster 
center to be less than the cluster radius determined by Simones, Newberg & Cole (2008). These radii, and distance 
and metallicity data from Harris (1996), are listed in Table 6. While the faint flux limits of SDSS imaging data limit 
this analysis only to relatively blue stars {g — i < 1-0), the color range where photometric metallicity can be determined 
is fully covered. 

For each cluster, we determine the median r-band magnitude in 0.05 mag wide bins of the g — i color. The red limit 
for the considered g — i range is set by requiring r < 21.5, and the blue end is selected to be at least 0.05 mag redder 
than the vertical part of the observed sequences (turn-off stars). The red limit ensures sufficient signal-to- noise ratios, 
and the blue limit is designed to minimize the evolutionary (age) effects on the shape of adopted relation. That is, 
we deliberately construct a relation that corresponds to small ages first, and then study its variation with age using 
observed and model color-magnitude sequences. The adopted g — i limits are listed in Table 6, and an example of this 
procedure (for M5) is shown in the top left panel of Figure [201 

We determine the shape of the photometric parallax relation by simultaneously fitting data for all five clusters. To 
do so, we first shift their r vs. {g — i) sequences to a uniform (arbitrary) magnitude scale by requiring that the 
median r magnitude for stars with 0.5 < g — i < 0.7 is 0. These offsets depend on the cluster metallicity, as discussed 
below. We then fit a parabola to all the data points, as a function of the g — i color, using and unweighted least 
squares method (a third-order polynomial is unnecessary to within ^0.05 mag). We used the g — i color because it has 
better signal-to-noise properties than g — r and r — i colors. We did not use the so-called "projection on stellar locus" 
technique developed in Paper I because it produces essentially identical results for relatively bright stars considered 
here. The stellar locus parametrization from Paper I can be used to express the fiducial sequence in terms of the g — r 
and r — i colors, if needed. 

The best-fit fiducial sequence is 

M°(g -i)^ -2.85 -I- 6.29 (g - i) - 2.30 {g - if, (Al) 

with M" ~ r— < r >= M^- < M^. >, valid for 0.3 < {g — i) S l-Oi ^^'^ the medians evaluated in the 0.5 < g — i < 0.7 
color range. As discernible from the cluster data shown in the top right panel of Figure [201 individual clusters follow 
the mean relation to within 0.1 mag or better (the rms scatter for all data points around the best-fit relation is 0.08 
mag). We compare the slopes of the predicted and observed sequences using the difference in absolute magnitudes at 
g — i = 0.4 and at g — i = 0.7 (the predicted value is 1.25 mag). The largest discrepancies of '--^O.l mag are observed 
for M13 (the observed sequence is steeper) and M15 (the observed sequence is shallower). These discrepancies may be 
caused by a combination of metallicity and age effects. 

We proceed by assuming that the shape of color-magnitude sequence given by eq. lAll is a universal function inde- 
pendent of metallicity, and that its norm,alization depends only on metallicity. While this is not strictly true, as we 
discuss below, the available data are not sufficient to robustly constrain the shape variation as a function of metallicity 
(and possibly other parameters, e.g. helium content; see Demarque & McClure 1980). 

We place the color-magnitude sequences for each cluster on an absolute scale using distances from Harris (1996). 
The offset of the measured globular cluster sequences relative to the best-fit fiducial sequence is a strong function of 
metallicity. We improve observational constraints on this relation by considering six additional clusters discussed by 
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Fig. 20. — The top left panel shows the color-magnitude diagram for the globular cluster M5 measured by SDSS. Individual stars are 
displayed as small dots, while the large dots show binned medians. The two dashed lines show the 2a envelope around these medians, and 
the solid line is the prediction based on the adopted photometric parallax relation (see text). The top right panel shows analogous binned 
medians for five globular clusters, with eac h se quence rescaled by the median magnitude for stars with 0.5 < g — i < 0.7. The short-dashed 
line shows a best-fit fiducial sequence (eg. IA1|| . For a comparison, the long-dashed line shows the [Fe/H] = —2.20 fiducial sequence from 
Beers et al. (2000). The dots in the bottom left panel show the absolute magnitude offsets relative to the fiducial relation for the five 
globular clusters listed in Table 6. The squares show a nalog ous offsets for the six globular cluster listed in Table 7. The short-dashed line 
is the best unweighted linear fit to both data sets (eq. IA2i . The thin long-dashed line is the AMy vs. [Fe/H] relationship from Laird, 
Carney & Latham (1988), shifted to produce the same AMr at [Fe/H] = —1.0, as the best fit derived here. The symbols with error bars 
(representing counting noise) in the bottom right panel show the distribution of differences between r-band magnitudes predicted using 
the adopted photometric parallax relation and the observed values. The histogram shows the expected scatter due to photometric errors. 

VandenBerg & Clem (2003). We used their figures to estimate for each cluster its My aX B—V = 0.60 (corresponding 
to g — i = 0.57), listed in Table 7. The corresponding M^ (i.e. the V ~ r color) are computed using the SDSS to 
Johnson system transformations from Ivezic et al. (2007). 

The data shown in the bottom left panel of Figure[20l strongly suggest a non- linear relationship (without the extended 
metallicity baseline thanks to the VandenBerg & Clem data, the five SDSS clusters would imply a linear relationship) . 
The best-fit parabola is 

^Mr{[Fe/H]) == 4.50 - 1.11 [Fe/H] - 0.18 [Fe/Hf, (A2) 



where AM^ is defined by 



Mr(g - i, [Fe/H]) = M°(.g - i) + AMr{[Fe/H]). 



(A3) 



The rms scatter around the best-fit relation is 0.05 mag for the eleven clusters used in the fit, with the maximum 
deviation of 0.08 mag. This remarkably small scatter around a smooth best-fit function suggests that the determination 
of AMr {[Fe/H]) offsets for individual clusters has a similar precision. Note, however, that the overall scale oi Mr{g — 
i, [Fe/H]) includes all systematic errors inherent in cluster distances that are adopted from Harris (1996) compilation 
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TABLE 7 
Additional Cluster Data from VandenBerg & Clem (2003) 

Name [Fe/H]"- M^ M^ 

M 92 -2.50 6.30 6.32 

M 68 -2.01 6.25 6.18 

47 Tuc -0.71 5.35 5.37 

Pleiades -0.11 4.80 4.79 

M 67 -0.04 4.75 4.72 

Hyades +0.12 4.50 4.53 

'^ Metallicity, taken from VandenBerg & Clem (2003), except for 47 Tuc, which is taken from 
Beers et al. (2000) (VandenBerg & Clem adopted [Fe/H] = —0.83, which produces a 0.1 mag 
fainter My prediction).'' The absolute y-band magnitude for B — V = 0.60, determined with an 
accuracy of 0.05-0.10 mag, from figures presented in Vanden Berg fc C lem (2003). '^ The absolute 
V-band magnitude for B — V = 0.60, determined using eqs. IA144A3I and the SDSS to Johnson 
transformations from Ivezic et al. (2006a). 

(including a possible covariance with cluster metallicity). The adopted relation produces gradients of dMj./d[Fe/ H] = 
—0.57 mag/dex at the median halo metallicity {[Fe/H] = —1.50), and —1.0 mag/dex at the median thin-disk metallicity 
{[Fe/H] — —0.2), with an offset of 1.05 mag between these two [Fe/H] values. As illustrated in the bottom left panel 
in Figure [20l the best-fit relation derived here is in excellent agreement at [Fe/H] < with an analogous relation 
proposed by Laird, Carney & Latham (1988). 

The distributions of differences between the r-band magnitudes predicted using the above expressions and the 
observed values for individual stars are consistent with expected noise due to photometric errors for all five clusters 
(see the bottom right panel of Figure [20l for an example based on M5). At the faint end (r ~ 21), the expected 
uncertainty in Mr is about 0.3 mag (random error per star), and is dominated by random photometric errors in the 
g — i color. At the bright end, the g — i errors (~0.03 mag) contribute an Mr uncertainty of ~0.15 mag, and an error 
in [Fe/H] of 0.1 dex results in Mr error of ^0.1 mag. The random errors in the g — i color and photometric metallicity 
are by and large uncorrelated because the w-band errors dominate the latter. 

The SDSS cluster data discussed here are not sufficient to extend the fiducial sequence beyond g ~ i ^ 1. While 
not required for the analysis presented here, we extend for completeness the adopted relation using the shape of the 
"bright" relation from Paper L Expressed as a function of the g — i color, 

M^{g -i) = -1.93 -t- 4.39 {g - i) - 1.73 {g ~ if + 0.452 {g - if, (A4) 

valid for (g — i) > 0.8. We test this extension further below. 

Testing 

Using SDSS observations for five clusters listed in Table 6, we first determined the median photometric metallicity 
for each cluster, using the best-fit expressions derived in this work. To avoid contamination by disk stars and noisy 
metallicity estimates, we only use stars with 0.3 < g — i < 0.5 and u < 21.5. Remarkably, the photometric metallicity 
estimates are consistent with the values taken from Harris (1996) to within '^O.l dex. This test ensures that eg. IA2I 
can also be used with photometric metallicity estimates. 

We have tested eas. lAlllXSl using an independent sample of clusters observed by SDSS at distances beyond our cutoff 
of 12 kpc (NGC 4147, NGC 5053, NGC 5466, NGC 5024 and Pal 5). The first four clusters have low metallicities 
{[Fe/H] ~ —2.0), and for Pal 5 [Fe/H] — —1.41. The r vs g — i ridge lines predicted by eq. lAll agree well with 
the observed sequences (the data are much noisier than for the first five nearer clusters due to their fainter apparent 
magnitudes). The only signficant discrepancy is observed for Pal 5, for which the predicted magnitudes are too faint 
by ~0.5 mag (using a distance of 23.2 kpc). 

To test the extension of photometric parallax relation to red colors, we use the Mv{B—V) sequence for M dwarfs with 
the Hipparcos data, as compiled in Fig. 17 from VandenBerg & Clem (2003): for B — F=(1.2, 1.3, 1.4), corresponding 
to g — i={1.51, 1.70, 1.93), we adopt My=(7.5, 8.0, 8.5). Assuming that metallicity of those stars is equal to the median 
thin disk metallicity, [Fe/H] = -0.13 (Nordstrom et al. 2004; AUende Prieto et al. 2004), we obtain My = (7.42, 7.91, 
8.54). For the reddest data point with V — 1—2.0, My— 9.5, and we obtain My— 9.4:7. This excellent agreement 
suggests that the extension given by eq. I A4I is good to within ^^0.1 mag ioi g — i < 2.2. 

For redder colors {g — i > 2.0), we compared our results with the relation derived by Bochanski et al. (2008, in 
prep.), which is based on ground-based trigonometric parallaxes for nearby stars (Golimowski et al. 2008, in prep.). 
Assuming a median metallicity of [Fe/H] = —0.13 for these stars, we found that the performance of eq. IA4I starts 
deteriorating around g — i = 3.0. In the range (2.0 < g — i < 2.8), our relation agrees with the Bochanski et al. relation 
within 0.07 mag (rms) and ^0.03 mag (median), and maximum deviation <0.1 mag, evaluated on a grid with 0.01 
mag steps. A linear relation in the range 2.8 < g — i < 4.0 

M°{g -i) = -4.40 + 3.97 {g-i) (A5) 



is a much better approximation to the observed sequence than eq. IA4I (but for a detailed fit please consult Bochanski 
et al.). Note that for [Fe/H] = —0.13, this relation must be shifted by 4.64 mag to obtain Mr (see eq. lA2[) . 
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Fig. 21. — The top left panel shows the difference between the color-magnitude sequences from Beers et al. (2 000) for three metallicity 
values (soUd: [Fe/_ff] = -2.20; long-dashed: [Fe/_ff] = -0.71; dot-dashed: {FejH\ = -1-0.12), and egs. IAHA31 derived here. The three 
short-dashed lines shows analogous differences for the M3, M13 and M92 sequences from Clem, VandenBerg & Stetson (2008), as marked. 
The systematic differences for blue stars are due to age effects. The solid hues in the top right panel show Mr for a Girardi et al. (2004) 
model with [_Fe///] = —0 .68, eval uated for three ages, as marked. The models are offset by 0.2 mag to brighter magnitudes, to match Mr 
computed using egs. lAll and IA7I (sho wn as t he d ashed line). The vertical long-dashed line marks the turn-off color for disk stars. The 
dashed line is Mr computed using egs. lAll and lATI The bottom left panel shows the median differences between t he SP SS distance modulus 
for main-sequence stars (determined using the Beers et al. sequences) and the values estimated using egs. lAll and lATI color-coded as shown 
in the inset. The two methods agree at the ~0.1 mag level (the mean for the median diff eren ce per pixel; the distribution rms width is 
~0.2 mag). The bottom right panel shows implied metallicity, estimated using egs. lAll and lA7l for the two photometric parallax relations 
proposed by Juric et al. (2008; solid line: "bright" relation; dashed line: "faint" relation). At the blue end, they bracket the median halo 
metallicity ([Fe/_ff] = —1.50); at the red end they sample the thin/thick disk metallicity range. 



As an additional test of the relation derived here, we compare it to color-magnitude sequences measured by Clem, 
VandenBerg & Stetson (2008) for three clusters that have turn-off colors bluer than g — i = 0.6 (M3, M13 and M92). 
Their data were obtained in the SDSS "prime" system, and we used expressions from Tucker et al. (2006) to transform 
those sequences onto the SDSS native system. For g — i > 0.5, their sequences for M3 and M13 are in good agreement 
(< 0.2 mag) with our predictions, while for blue colors close to the turn-off color, they become progressively brighter, 
as expected (see the top left panel of Figure [21]) . For M92, discrepancies are larger than ~0.2 mag even for red colors 
(g — J ~ 1). However, based on photometric transformations from Tucker et al. (2006) and Ivezic et al. (2007), we find 
that the M92 sequence in the SDSS "prime" system from Clem, VandenBerg & Stetson (2008) and the M92 sequence 
in rhw Johnson system from VandenBerg & Clem (2003) are not consistent. For example, V = 20.9 a,t B — V = 0.6 
taken from VandenBerg & Clem implies r — 20.7, while data listed in Table 3 from Clem, VandenBerg & Stetson 
imply r — 20.45 at the corresponding color. We emphasize that the same photometric transformations result in good 
agreement for the other two clusters, and that the color-magnitude sequence for M92 from VandenBerg & Clem agrees 
with our relation to within 0.1 mag. 
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The top left panel of Figure [21] shows a comparison of the relation derived here with the three sequences from Beers et 
al. (2000). Similarly to the comparison with the Clem, VandenBerg & Stetson sequences, our relation predicts fainter 
magnitudes for blue turn-off stars, as expected. We emphasize that these differences are not due to errors in the 
color-magnitude sequences adopted by Beers et al., because they agree with other sources, e.g., with the VandenBerg 
& Clem (2003) data. Rather, the differences are due to our design choice to exclude from fitting the parts of the 
clusters' color-magnitude sequences that are too close to their turn-off color. 

Our results show that the Beers et al. spline interpolation of metallicity effects based on only three clusters performs 
remarkably well. The largest overall discrepancy between our photometric parallax relation and the three Beers et al. 
sequences for red colors {g — i > 0.6) is observed for 47 Tuc: for 1.0 < g — i < 1.8, the predicted M^ are too bright by 
0.4 mag. Since agreement at our fiducial g — i ^ 0.6 is satisfactory, this difference implies that the color-magnitude 
sequence is for 47 Tuc is steeper than for other clusters discussed here. This peculiarity of 47 Tuc has been known 
for some time, and may be related to its anomalous helium content (Demarque & McClure 1980; Hesser, Harris & 
Vandenberg 1987). We note that our relation predicts absolute magnitudes for red stars {B~V > 1) that are brighter 
by ^^0.3 mag than the data for extremely metal-rich {[Fe/H] = -1-0.37) open cluster NGC 6791 from VandenBerg & 
Clem (2003). 

Age effects and Comparison with Models 

By design, the photometric parallax relation derived here avoids the increased curvature of the color-magnitude 
sequence close to the turn-off color. Its blue edge is constrained by the parts of the M3 and M15 sequences that are 
redwards from their turn-off colors (see Table 6 and the top right panel of Figure E(I)l . For stars with turn-off colors, 
the predicted absolute magnitudes can be up to ~1 mag too faint. For example, for M5 turn-off stars selected by 
0.25 < g — i < 0.35 ((r) — 18.6), the difference between predicted and observed r-band magnitudes is well described 
by a Gaussian distribution with a mean of 0.22 mag and a = 0.49 mag, implying underestimated distances by 11%, 
on average. 

The effect of age on turn-off color and absolute magnitude, as a function of metallicity, can be gauged with the aid of 
model isochrones, e.g., such as those developed for SDSS photometric system by Girardi et al. (2004). While modeling 
difficulties prevent absolute normalization of such models to better than ~0. 1-0.2 mag even for hot stars (and much 
worse for stars with g — i > 1), their relative behavior, as a function of age, provides valuable guidance. The Girardi 
et al. models show that the turn-off color is bluer than g — i = 0.6 even for 13 Gyr old populations and a metallicity 
at the upper end of the range relevant here {[Fe/H] — —0.4). Hence, the adopted relation is insensitive to age effects 
for t; — j > 0.6. For g — i < 0.6, it needs to be corrected as a function of metallicity and age. 

The mean ages of halo and disk stars considered in this work can be estimated from the blue edge of their color 
distributions. The number of stars drops precipitously bluer than g — i ^ 0.25 for the low-metallicity subsample 
{Fe/H < —1, halo stars), and at 5 — i ~ 0.4 for high-metallicity subsample (disk stars). Interestingly, the Girardi 
et al. models suggest similar age for both subsamples: ~10 Gyr, with an estimated uncertainty of ~2 Gyr (due 
to metallicity and color zeropoint uncertainties; we adopted 0.2 dex and 0.05 mag, respectively). Motivated by this 
result, we derive an age correction appropriate for stars with median halo metallicity and age of '-^lO Gyr using the 
color- magnitude sequence for cluster M13 {[Fe/H] ~ —1.54). For 0.22 < g — i < 0.58 

AM^^^{g -i) = -2.17 + 6.64 {g - i) - 5.00 {g - if, (A6) 

which increases from at the red edge to -0.95 mag at g — i — 0.22, and has to be added to right-hand side of eq. IA3I 
This correction for age is not strictly applicable to stars with higher disk-like metallicity. However, the Girardi et 

al. models suggest that the error is small, < 0.2 mag for t/ — i > 0.45 (i.e. 0.05 redde r th an the turn-off color for disk 

stars), as illustrated in the top right panel of Figure [211 For this reason, we adopt eq. IA6I as a universal age correction 

for stars bluer than g — i < 0.58. 
Given different expressions for three color ranges feas. lAlllA41 and lA5[) and the above age correction, for convenience 

we fit a fifth-order polynomial to a vector of Mr values generated using the appropriate expressions for 0.2 < g — i < 4.0, 

with a step size of 0.01 mag. Our final expression 

M°(g - i) = -5.06 -I- 14.32 a; - 12.97 a;^ -I- 6.127x3 - 1.267 a;'* -I- 0.0967 a;^ (A7) 

where x — {g — i), reproduces individual Mr values with an rms of 0.05 mag and maximum deviation below 0.1 mag. 
Together with eqs. lA2l and lA3[ this is the final photometric parallax relation used in this work. 

We have compared a large number of Girardi et al. models that span the relevant range of metallicities (—2.3 < 
[Fe/H] < 0) and ages (1-13 Gyr) with the resulting photometric parallax relatio n. M odel predictions are in good 
agreement (an rms of ~0.1 mag) with the Mr vs. [Fe/H] dependence described by eq. lAH but the model Mr predictions 
are systematically too faint by ~0.2 mag (evaluated aX g — i = 0.7). Possible explanations for this difference are 1) the 
model stars are too small by ~10%, 2) the model g — i color is too red by 0.06 mag, and 3) the model [Fe/H] scale 
is offset relative to SDSS scale by ^--^0.3 dex to larger values. A plausible combination of these effects, e.g., an error of 
3% in sizes, 0.02 mag in color, and 0.1 dex in metallicity, brings data and models into agreement (the probability that 
all three effects would have the same sign is 12%). 

Comparison with SDSS Distances and JOS 

With the adopted age correction feq. lA6p . our final expression is expected to produce very similar distances to those 
published in SDSS Data Release catalogs for blue stars {g — i < 2). We have confirmed that this is the case: the 
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median offset of implied Mr evaluated in small bins oiu — g and g ~ r color (see the bottom left panel of Figure [21]) is 
-0.07 mag, with an rms of 0.06 mag. These differences are smaller than the intrinsic errors of the photometric parallax 
method (~0.1-0.2 mag). 

Using eqs. lA21lA31 and lA7[ we can now determine "effective" metallicity that the two photometric parallax relations 
proposed in Paper I correspond to, as a function of the g -- i color (sec the bottom right panel in Figure [21]) . As 
designed, those two relations bracket the median halo metallicity {[Fe/H] ~ —1.50) at the blue end, and sample the 
thin/thick disk metallicity range at the red end. 

In summary, the relations proposed here are in good agreement (< 0.1 mag) with the clusters MS and MIS at the 
low-metallicity end for g — i < 1.5, and with local stars with trigonometric parallaxes for g — i > 1.5. At a fiducial 
color g — i = 0.6, in the middle of the color range where photometric metallicity can be estimated, the rms scatter 
around the best-fit AMr vs. [Fe/H] curve is 0.08 mag. Even in cases of known peculiar behavior (e.g., 47 Tuc) and 
at the high-metallicity end (e.g., NGC 6791), discrepancies do not exceed 0.4 mag. Compared to the Beers et al. 
relations used by the SDSS, here we provide an estimate of the scatter around mean relations, a closed-form expression 
for the metallicity dependence, and extend the method's applicability farther into the red, to g — i ~ 4. Given the 
larger number of globular clusters observed in SDSS system used here, as well as tests based on external data sets, 
it is likely that distance estimates for main-sequence stars based on the photometric parallax method (both using 
relations derived here and the Beers et al. relations) do not suffer from systematic errors larger than ^10%. While 
these systematic distance errors are not overwhelming, they could, in principle, have an impact on the analysis of the 
Milky Way kinematics. We discuss such issues further in Paper III (BOS). 

ADDITIONAL DISCUSSION OF THE PHOTOMETRIC TEMPERATURE ESTIMATOR 
Often, the inverse of the effective temperature is fit as a linear function of color (e.g. CPF). The best- fit 

5?^ = 0.532(5 - r) + 0.654 (Bl) 

Jeff 

results in the same systematic errors and rms scatter as eq. [31 with the largest difference between the two relations 
below 50 K. 

A lower limit for the errors in estimation of the photometric effective temperature can be readily computed using 
eq. [Hand the photometric errors in the g — r color (the median value is 0.025 mag, and 0.03 mag a,t g = 19.5; these 
values, computed by the photometric pipeline, are reliable, as discussed in detail by Sesar et al. 2007). This is a 
lower limit, because the contribution of errors in the spectroscopic effective temperature is not included. The standard 
deviation for the distribution of metallicity residuals normalized by these errors is 1.2. Hence, one is tempted to 
conclude that the accuracy of the effective temperature estimator is limited by the SDSS photometric errors. However, 
this conclusion is not consistent with the behavior of the log(Toff) vs. g — r relation for a subset of 13,719 stars 
for which more accurate photometry, based on ~10 repeated SDSS observations, is available (Ivezic et al. 2007). 
Although for these stars the median error in the g — r color is only 0.008 mag, the standard deviation for log(reff) 
residuals is not appreciably smaller (the expectation is a decrease by a factor of three). Therefore, it is quite likely 
that the contribution of errors in the spectroscopic effective temperature to the scatter of log(Toff) residuals is not 
negligible. Indeed, the implied value of ~100 K agrees well with an independent estimate based on a comparison to 
high-resolution spectral data, as discussed by Beers at al. 2006. The analyzed color range spans about 15 MK spectral 
subtypes (from ~F5 to ^^097X0, Bailer-Jones et al. 1997, 1998). Hence, the uncertainty in the photometric effective 
temperature estimate of 100 K corresponds to about one spectral subtype, or equivalently, an error of one spectral 
subtype corresponds to a g — r error of 0. 02 mag. 

A good correlation between the spectroscopic effective temperature and g — r color extends beyond the restricted 
color range where the photometric metallicity method is applicable (0.2 < g — r < 0.4). We find that everywhere in 
the —0.3 < g — r < 1.3 color range (roughly —0.1 < B — V < 1.3), the relation 

log(reff/K) = 3.882 - 0.316(g - r) + 0.0488(.g - r)^ + 0.0283(.g - rf (B2) 

achieves systematic errors below 0.004 dex and overall rms of 0.008 dex. The corresponding temperature range is 4,000 
- 10,000 K. Whe n th e residuals are binned in 0.1 dex wide bins of metallicity and log(g), the largest median residual 
is 0.006 dex. Eq. IB II remains valid in the —0.3 < g — r < 0.8 range, but also requires non-linear terms if extended to 
redder colors (or a different linear fit for the 0.8 < g — r < 1.3 range). 

Due to the expanded g — r range, the impact of metallicity and log(g) on log(T'off) residuals is expected to be larger 
for this relation than for eq. [3l Using Kurucz (1979) models, we find that the strongest dependence on metallicity is 
expected in the 0.4 < g — r < 1.2 color range, with a gradient of ~0.015 dex/dex (see also Lenz et al. 1998 for a related 
discussion). The measured value for SDSS sample is 0.012 dex/dex, and implies up to ~200 K offsets as metallicity 
varies from —2.0 to —0.5. The strongest dependence on log(g) is expected in the —0.2 < g — r < 0.1 color range, with 
a gradient of 0.02 dex/dex. The measured value for the SDSS sample is ^--^0.004 dex/dex, or about five times smaller 
(150 K vs. 720 K variation, as log(g) varies by 2 dex at 5 — r = 0). We do not understand the cause of this discrepancy. 

ADDITIONAL DISCUSSION OF THE PHOTOMETRIC METALLICITY ESTIMATOR 

In two restricted color regions, simpler expressions than eq. [H can suffice. In a low-metallicity region defined by 
0.8 < u — g < 1.0 (and 0.2 < g — r < 0.6, of course), the relation 

[Fe/HU^5.Uiu~g)~6.10 (CI) 
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reproduces the spectroscopic metallicity of about 27,000 stars with an rms scatter of 0.29 dex. We note that this 
is essentially the same expression as obtained by Ivezic et al. (2006b), using a preliminary version of spectroscopic 
parameter pipeline, except for an overall shift in metallicity by 0.2 dex. This shift is due to the fact that SDSS stellar 
parameters pipeline was still under development when the analysis of Ivezic et al. (2006b) was performed. 
In the high effective temperature region (5800 K < Toff < 6600 K) defined by 0.2 < g — r < 0.4, the relation 



[Fe/H]ph = -21.88 + 47.39(u - g) - 35.50(u - g)^ + 9.018(u - g)^ 



(C2) 



reproduces spectroscopic metallicity of about 34,000 stars with an rms scatter of 0.30 dex. In the range 0.8 < u—g < 1.4 
(corresponding to —2.0 < [Fe/H] < —0.4) systematic errors do not exceed 0.1 dex. The systematic errors are larger 
than for eq. |4] because the lines of constant metallicity in the 5 — r vs. u — g diagram are not exactly vertical. Despite 
having somewhat poorer performance, eqs. ICll and IC2I are convenient when estimating the impact oi u — g color 
error on photometric metallicity error. An error in the u — g color of 0.02 mag (typical of both systematic calibration 
errors and random errors at the bright end for SDSS data) induces an error in [Fe/H] that varies from 0.02 dex at 
[Fe/H] = -0.5 (m - 5=1.28) to 0.11 dex at [Fe/H] = -1.5 {u - g=0.89). At g = 19.5, the median u- g error 
for single-epoch SDSS data is 0.06 mag for point sources with 0.2 < g — r < 0.4, corresponding to median random 
metallicity errors of 0.10 dex for disk stars and 0.30 dex for halo stars (for a detailed dependence of SDSS random 
photometric errors on magnitude, see Sesar et al. 2007). 

The metallicity vs. u — g relation has a smaller slope at the red end (both eq.|4]and eq. lC2p . and effectively introduces 
an upper limit on estimated metallicity. For example, for u — 5=1.3 and g — r — 0.4, [Fe/H]ph = —0.44 (from eq. HI 
and —0.46 using eq. IC2[) . Such an upper limit is in agreement with the data analyzed here, but we emphasize that 
the data set under consideration does not include significant numbers of stars with higher metallicity. Such stars are 
presumably nearby thin-disk stars, which in the 0.2 < g — r < 0.6 range are typically saturated in SDSS data (most 
SDSS data to date are obtained at high Galactic latitudes). It is thus possible that metallicity estimates given by 
both eq. [4]and eq. lC2l would be biased towards lower values for stars with [Fe/H] > —0.5, resulting in a "metallicity 
compression". Some evidence that this is a detectable, but not a major effect is discussed in S 13.4. 11 and in more detail 
by Lee et al. (2007b). It will be possible to quantify this effect in detail using the data for metal- rich stars from the 
ongoing SDSS spectroscopic survey of low Galactic latitudes (SEGUE). 

Given that the u-band photometric errors limit the precision of photometric metallicity estimates at the faint end, 
it is prudent to test whether the position of a star in the r — i vs. g — r color-color diagram could be used as an 
alternative method. We selected subsamples of stars in 0.02 mag wide g ~ r bins, and inspected the dependence of 
spectroscopic metallicity on the r — i color in the range —2.5 < [Fe/H] < —0.5. The strongest correlation between 
[Fe/H] and r — i color is observed around g — r ^ 0.4, with a gradient of A(r — i)/A[Fe/H] ^ 0.017 mag/dex. Hence, 
the effect of metallicity on the r — i color is about 10 times smaller than for the u — g color. With the r — i color 
kept fixed, we find A{g — r)/A[Fe/H] ^ 0.04 mag/dex. When using only the gri bands, the photometric metallicity 
errors are about 0.3 dex at the bright end and 0.5 dex at 5 = 19.5 (<0.1 dex and <0.3 dex for ugr based estimates). 
Therefore, the best approach for estimating photometric metallicity using SDSS data is to use the ugr bands. 
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